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Abstract. We introduce a general method to count unlabeled combinatorial structures and to 
efficiently generate them at random. The approach is based on pointing unlabeled structures 
in an "unbiased" way that a structure of size n gives rise to n pointed structures. We extend 
Polya theory to the corresponding pointing operator, and present a random sampling framework 
based on both the principles of Boltzmann sampling and on Polya operators. All previously 
known unlabeled construction principles for Boltzmann samplers are special cases of our new 
results. Our method is illustrated on several examples: in each case, we provide enumcrativc 
^— ^ ' results and efficient random samplers. The approach applies to unlabeled families of plane and 

, nonplane unrooted trees, and tree-like structures in general, but also to families of graphs (such 

as cacti graphs and outerplanar graphs) and families of planar maps. 

This is the extended and revised journal version of a conference paper with the title "An 
unbiased pointing operator for unlabeled structures, with applications to counting and sam- 
pling", which appeared in the Proceedings of ACM-SIAM Symposium on Discrete Algorithms 
(SODA '07), 7-9 January 2007, New Orleans. 
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1. Introduction 



Pointing (or rooting) is an important tool to derive decompositions of combinatorial structures, 
with many applications in enumcrativc combinatorics. Such decompositions can for instance be 
used in polynomial-time algorithms that sample structures of a combinatorial class uniformly 
at random. For a class of labeled structures, pointing corresponds to taking the derivative of 
s ! ' the (typically exponential) generating function of the class. In other words, each structure of 

size n gives rise to n pointed (or rooted) structures. Other important operations on classes of 
combinatorial structures are the disjoint union, the product, and the substitution operation - they 
1^ , correspond to addition, multiplication, and composition of the associated generating functions, 

■^j- ■ Together with the usual basic classes of combinatorial structures (finite classes, the class of finite 

sets, the class of finite sequences, and the class of cycles), this collection of constructions is a 
■ powerful device to define a great variety of combinatorial families. 

If a class of structures can be described by recursive specifications involving pointing, disjoint 
unions, products, substitutions, and the basic classes, then the techniques of analytic combinatorics 
can be applied to obtain enumerative results, to study statistical properties of random structures 
in the class, and to derive efficient random samplers. An expository account for this line of 
research is [10] . Among recent developments in the area of random sampling are Boltzmann 
samplers [9] , which are an attractive alternative to the recursive method of sampling [251 111] . 
Both approaches provide in a systematic way polynomial-time uniform random generators for 
decomposable combinatorial classes. The advantage of Boltzmann samplers over the recursive 
method of sampling is that Boltzmann samplers operate in linear time if a small relative tolerance 
is allowed for the size of the output, and that they have a small preprocessing cost, which makes 
it possible to sample very large structures. 

A third general random sampling approach is based on Markov chains. This approach does not 
require recursive decompositions of structures, and is applicable on a wide range of combinatorial 
classes. However, Markov chain methods are mostly limited to approximate uniformity. Moreover, 
it is usually difficult to obtain bounds on the rate of convergence to the uniform distribution [20] . 

All the results in this paper concern classes of unlabeled combinatorial structures, i.e., the 
structures are considered up to isomorphism. In the case of the class of all graphs, the labeled and 
the unlabeled model do not differ much, which is due to the fact that almost all graphs do not have 
a non-trivial automorphism (see e.g., [5j Ch. 9.] and [U])- However, for many interesting classes of 
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combinatorial structures (and for most of the classes studied in this paper) , the difference between 
the labeled and the unlabeled setting does matter. 

The Markov chain approach can be adapted also to sample unlabeled structures approximately 
uniformly at random, based on the orbit counting lemma }19) . Again, the rate of convergence is 
usually very difficult to analyze, and frequently these Markov chains do not have a polynomial 
convergence rate [15]. For unlabeled structures, the approach faces additional difficulties: to 
computationally implement the transitions of the Markov chain, we have to be able to generate 
structures with a given symmetry uniformly at random (this is for instance a difficult task for 
planar graphs) , and we have to be able to generate a symmetry of a given structure (for the class 
of all finite graphs, this task is computationally equivalent to the graph isomorphism problem). 

To enumerate unlabeled structures, typically the ordinary generating functions are the appro- 
priate tool. Disjoint unions and products for unlabeled structures then still correspond to addition 
and multiplication of the associated generating functions as for labeled cases. Boltzmann samplers 
for classes described by recursive specifications involving these operations have recently been de- 
veloped in |13j . However, the substitution operation for unlabeled structures no longer corresponds 
to the composition of generating functions, due to the symmetries an unlabeled structure might 
have. This problem can be solved by Polya theory, which uses the generalization of generating 
functions to cycle index sums to take care of potential symmetries. Polya theory provides a com- 
putation rule for the cycle index sum associated to a substitution construction. The presence of 
symmetries leads to another problem with the pointing (or rooting) operator: the fundamental 
property that a structure of size n gives rise to n pointed structures does not hold. Indeed, if 
a structure of size n has a non-trivial automorphism, then it corresponds to less than n pointed 
structures (because pointing at two vertices in symmetric positions produces the same pointed 
structure). Thus, for unlabeled structures, the classical pointing operator does not correspond to 
the derivative of the ordinary generating function. 

In this paper, we introduce an unbiased pointing operator for unlabeled structures. The op- 
erator is unbiased in the sense that a structure of size n gives rise to n pointed structures. It 
produces cycle-pointed structures, i.e., combinatorial structures A together with a marked cyclic 
sequence of atoms of A that is a cycle of an automorphism of A. Accordingly, we call our oper- 
ator cycle-pointing. The idea is based on Parker's lemma in permutation group theory [7] (this 
will be discussed in Remark 13. ip . We develop techniques to apply this new pointing operator to 
enumeration and random sampling of unlabeled combinatorial classes. The crucial point is that 
cycle-pointing is unbiased. As a consequence, performing both tasks of enumeration and uniform 
random sampling on a combinatorial class is equivalent to performing these tasks on the associated 
cycle-pointed class. 

To understand how we use our operator, it is instructive to look at the class of free trees, i.e., 
unrooted and nonplane trees (equivalcntly, acyclic connected graphs). Building on the work of 
Cayley and Polya, Otter |26j determined the exact and asymptotic number of free trees. To this 
end, he developed the by- now standard dissimilarity characteristic equation, which relates the 
number of free trees with the number of rooted nonplane trees; see |16j . The best- known method 
to sample free trees uniformly at random is due to Wilf [33] , and uses the concept of the centroid of 
a tree. The method is an example of application of the recursive method of sampling and requires 
a pre-processing step where a table of quadratic size is computed. 

Cycle-pointing provides a new way to count and sample free trees. Both tasks are carried out on 
cycle-pointed (nonplane) trees. The advantage of studying cycle-pointed trees is that the pointed 
cycle provides a starting point for a recursive decomposition. In the case of cycle-pointed trees, 
we can formulate such a decomposition using standard constructions such as disjoint union, prod- 
uct, and substitution (which requires to suitably adapt these constructions to the cycle-pointed 
framework). We want to stress that, despite some superficial similarities, this method for count- 
ing free trees is fundamentally different from the previously existing methods mentioned above, 
and it proves particulary fruitful in the context of random generation. Indeed, the dissimilarity 
characteristic equation |26j and the dissymmetry theorem [2] both lead to generating function 
equations involving subtraction. However, subtraction yields massive rejection when translated 
into a random generator for the class of structures (both for Boltzmann samplers and for the 



BOLTZMANN SAMPLERS, POLYA THEORY, AND CYCLE POINTING 



3 



recursive method). In contrast, the equations produced by the method based on cycle-pointing 
have only positive signs, and the existence of a Boltzmann sampler for (cycle-pointed) free trees, 
with no rejection involved, will follow directly from the general results derived in this paper. As 
usual, the Boltzmann samplers we obtain have a running time that is linear in the size of the 
structure generated, and have small pre-processing cost. 

Similarly, we can decompose plane and nonplane trees, and more generally all sorts of tree- 
like structures. By the observation that the block decomposition of a graph has also a tree-like 
structure, we can apply the method to classes of graphs where the two-connected components can 
be explicitly enumerated. This leads to efficient Boltzmann samplers, for instance, for cacti graphs 
and outerplanar graphs, improving on the generators of [3]. Further, our strategy is not limited 
to only tree-like structures, but can also be applied to other classes of structures that allow for a 
recursive decomposition. To demonstrate this, we sketch how the method can be applied to count 
and sample certain classes of planar maps. 

Outline of the paper. To formalize our general results on enumeration and sampling for classes 
of unlabeled combinatorial structures in full generality, we apply the concept of combinatorial 
species; we recall this concept in Section [2] In Section [3] we introduce cycle-pointed species and 
the cycle-pointing operator. Section 2] is devoted to applications of our cycle-pointing operator 
in enumeration. The technique in which we use the operator to obtain recursive decomposition 
strategies for unlabeled enumeration is very generally applicable; we illustrate it by the enumer- 
ation of (unrooted) non-plane and plane trees, cacti graphs, and maps. Finally, in Section [5j 
we present how to apply the concepts introduced in this paper to obtain highly efficient random 
sampling procedures for unlabeled structures. This is illustrated by applications for sampling of 
concrete and fundamental classes of unlabeled combinatorial structures; several of these concrete 
sampling results are either new or improve the state-of-the-art of sampling efficiency. 

2. Preliminaries 

In this paper we work with classes of combinatorial structures, such as graphs, relational struc- 
tures, functions, trees, plane trees, maps, words, terms, or permutations. There are many ways to 
define formally these objects; for example, rooted trees can be coded as special types of directed 
graphs, but also as terms. Which formal representation of a combinatorial structure is best usually 
depends on the application. 

When we are interested in combinatorial enumeration, the differences in representation are not 
essential; in the example above, we have the same number of rooted trees, no matter how they 
are represented. Wc would thus like to have a formalism that is sufficiently abstract so that our 
results apply to broad classes of combinatorial structures. At the same time, we would like to have 
a formalism for classes of combinatorial structures that supports fundamental construction oper- 
ations for combinatorial classes, such as formation of disjoint unions, products, and substitution, 
and allows to state general results about enumeration and random sampling. 

The theory of combinatorial species is an elegant tool that fully satisfies the needs mentioned 
above. We give a brief introduction to species, and refer to Bergeron, Labelle, and Leroux [2] 
for a broader treatment of the topic. The fundamental and well-known classes of combinatorial 
structures that we treat in Section @] provide ample illustration of the concepts wc define in this 
section. 

2.1. Combinatorial Species. We closely follow the presentation in [2]. A species of structures 
is a functor A from finite sets U to finite sets -A[£7], together with a rule that produces for each 
bijection a : U — > V a function from A[U] to -4[V]. Slightly abusing notation, this function will 
also be denoted by A[a\. The functions A[a] must satisfy the following two (functorial) properties: 

• for all bijections a : U —> V and r : V — >• W, 

A[too] = A[t] o A[a] ; 

• for the identity map Idu : U — > U, 

A[Idu) = ld A[u] . 
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An element A G A[U] is called an A- structure on U . The elements of U are called the atoms of 
A[U], and we refer to \U\ as the size of A. The function A[cr] is called the transport of 71-structures 
along cr. We write a • A for .A[cr](.A). 

Definition 2.1. Let A\ G -A[J7] and A2 G .A[V] be two A-structures. A bijcction cr : U V is 
called an isomorphism from Ai to A 2 if A 2 = a • A\ = A[o~](Ai). An isomorphism from A\ to A\ 
is called an automorphism of Ai . 

The advantage of species is that the rule A that produces the structures A[U] and the transport 
functions A[a] can be described in any way; the book [2] gives instructive examples where this 
description is by axiomatic systems, explicit constructions, algorithms, combinatorial operations, 
or functional equations. 

Definition 2.2. A species A is a subspecies of a species 23 (and we write A C 23) if it satisfies the 
following two conditions: 

• for any finite set U, A[U] C £[{/]; 

• for any <r : ?7 — >• we have A[<r] = 23 [cr] | s ryi . 

2.2. Enumeration. For all finite sets U, the number of A-structures on U depends only on the 
number of elements of U (and not on the elements of U). If A is a species, we write a n for 
|-A[{1, . . . , n}]\, the cardinality of the set of A-structures on {1, . . . , n}. The series 

A(x) := ~fnx n (1) 

n>0 

is called the exponential generating series of the species A. 

In this article we focus on unlabeled structures, i.e., we consider structures up to isomorphism. 
We may restrict ourselves to structures on sets of the form U = [l..n] := {1, . . . , n}, and write 
A[n] for A[{1, . . . , n}]. We define the equivalence relation ~ on A[n] by setting A\ ~ A2, for 
A\,A2 G A[n], if and only if there is an isomorphism cr : [l..n] —> [l..n] from A\ to A2. We also 
say that A\ and A2 have the same isomorphism type, and the equivalence classes of A-structures 
on [l..n] with respect to ~ are also called unlabeled A-structures of size n, The set of all those 
equivalence classes is denoted by A n , and we write a n for its cardinality \A n \ . The series 

A(x) := J2"nx n (2) 

n>0 

is called the ordinary generating series (OGS) of A. We use the classical notation [x"]A(a;) to 
denote the n-th coefficient a n in the power series A{x). 

2.3. Cycle index sums. For a species A and each n > 0, a symmetry of A of size n is a pair 
(A, cr) where A is from A[n] and a is an automorphism of A. We call A the underlying structure of 
the symmetry (A, a). Notice that the automorphism a can be the identity. We denote by Sym(A) 
the species defined by 

Sym(yi)[n] := {(A, a) | A G A[n] and cr is an automorphism of A} . 

The definition of the transport of Sym(A) is obvious (the definition of symmetries as an auxiliary 
species is standard; see Section 4.3 in [2]). The weight-monomial of a symmetry (A, a) of size n 
is defined as 

1 " 

' i=l 

where, for i G [1..7i], Sj is a formal variable and Ci{a) is the number of cycles of a of length i. For 
simplicity, in the following we will write Ci for Ci(a) if the corresponding automorphism is clear 
from the context. The cycle index sum of A, denoted by Zji(sx, S2, ■ ■ •), or shortly Zji, is the 
formal power series defined as the sum of the weight- monomials over all the symmetries of A, 

Z A (s 1 ,s 2 ,...):=^2\ W (A,a) J • ( 4 ) 

n>0 \(A,a)eSym(A)[n] I 
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Figure 1. The cycle index sums of basic species and of species composed by +, 
•, and o. 



Cycle index sums for classes of combinatorial structures have been introduced by Polya [28] . The 
following fact, which is based on Burnside's lemma, shows that cycle index sums are a refinement 
of ordinary generating series. 

Lemma 1 (Polya). Let A be a species of structures. For n > 0, each unlabeled structure A G A n 
gives rise to n\ symmetries, i.e., there are nl symmetries (A, a) such that A € A. Hence, 

Z A {x,x 2 ,x 3 ,...) =A{x). (5) 

In the proofs of combinatorial identities it will be convenient to identify species that are essen- 
tially the same. 

Definition 2.3. Let A and 23 be two species. An isomorphism from A to 23 is a family of bijections 
ajj : A[U] — > H[U] which satisfies the following naturality condition: for any bijection a : U — > V 
between two finite sets U and V , and for any .A-structure A on U, one must have 

a • old (A) ~ ay(o- • A) . 

It is known (see [2]) that when A is isomorphic to 23, then A(x) = 23(x), A(x) = 23(a;), and 
Zj\(x\,X2, ■ ■ ■ ) = Z'b(xi,X2, ■ ■ ■ )■ Hence, for the purposes of combinatorial enumeration we can 
even identify isomorphic species, and as in [2] we write A = 23 when A and 23 are isomorphic 
species. 

2.4. Basic species and combinatorial constructions. We now recall a collection of basic 
species and combinatorial constructions. We start with the description of some basic species, and 
then introduce the three fundamental constructions of disjoint union, product, and substitution. 

2.4.1. Basic Species. The most elementary species are 

• the empty species defined by 0[U] := for all U; 

• the neutral species 1 defined by 1[U] := {0} if \U\ =0 and 1[U] := otherwise; 

• the atomic species X defined by X[U] := {U} if \U\ = 1 and X[U] := otherwise; 

• the species of sets Set defined by Set[[7] := {U}; 

• the species Seq of sequences defined by Seq[C7] := {([/, <) | U is linearly ordered by <}; 
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• the species Cyc of oriented cycles (or cyclic permutations) denned by 

Cyc[{7] := {(U, 7r) | 7r is a cyclic permutation of U} . 

In each case, the definition of the transport A[a] of yi-structures is obvious. 

If 21 is a species, then 21 ^ denotes the species defined by A^[U] := A[U] if \U\ = n and 
A[U] := otherwise. The transport A [n] [a] for bijections a : U -> V is defined by A [n] [a] := A[a] 
if \JJ\ = n and ./^[cr] := Id^rp-] otherwise. In particular, X is Set' 1 '. 

2.4.2. Constructions. We now describe the three fundamental constructions that we use to con- 
struct species from other species. 

Disjoint union. For two species A and 23, the species .A + 23, called the disjoint union (or sum) 
of A and 23, is defined by (A + "B)[U] = ({1} x A[U}) U ({2} x 23 [£/]). The transport along a 
bijection a : U — > V is carried out by setting, for any (A + 23)-structure C on U, 



(A + "B)[a}((i,C)) := 



A[a]{C) i£i=l 
23[ct](C7) ifi = 2 



Note that when ^/l, ./I', 23, 23' are species such that A = A' (i.e., .A and A' are isomorphic) and 
23 = 23', then A + 23 = A' + 23'. 

The disjoint union of a countably infinite sequence Ai,yi2, ... of species is defined analogously, 
setting C%2i>iAi)[U] := Ui>i(^-^j[^]) ( tms species is well-defined provided the set on the right 
is finite for each finite set U). 

Product. The cartesian (often also called partitional or dinary) product A • 23 of two species A 
and 23 is the species defined as follows. An (A • 23)-structure on U is an ordered pair C = (A, B) 
where A is an ^.-structure on a set U\ C U, B is a 23-structure on a set £/ 2 C U, XJ\ n U2 = 0, 
and Ui U U2 = U . The transport along a bijection a : U — > V is carried out by setting, for each 
(A ■ 23)-structurc C = {A, B) on U, 

(A-%)[a](C) := (A[o- 1 ](A),'B[a 2 ](B)) 

where Oi is the restriction o\u i of a on Ui for i g {1,2}. 

Again we note that when A, A 1 , 23, 23' are species such that A = A' and 23 = 23', then A • 23 = 
yi'-23'. 

Substitution. Given two species ^4. and 23 such that 23 [0] = 0, the (partitional) composite of 23 in 
A, denoted by A a 23, is the species C obtained as follows. A C-structure on U is a triple (tt, A, B) 
where 

• 7r is a partition of U; 

• A is an yi-structure on the set of classes of tt; 

• B := (B p )p£i T where for each class p of tt, B p is a 23-structure on p. 

The transport along a bijection a : U — > V is carried out by setting, for any C-structurc C = 
(it, A, (B p ) pe7T ) on U 

G[a](C) := (M,(%)f>er) 

where 

• 7r is the partition of V obtained by transport of tt along er; 

• the structure A is obtained from the structure A by A-transport along the bijection a : 
tt — > 7f induced by <r on 7r; 

• for each p = o(p) € tt, the structure Bp is obtained from the structure B p by 23-transport 
along a\ p . 

We call the ,/l-structure A the core of (-7T, A, B), and B p in B = (Bp) p&7T a component of (71", A, B). 
Also for the substitution construction, we have that when A, A', 23, 23' are species such that A = A' 
and 23 = 23', then A o 23 = A' o 23'. 

Together with the basic species, these constructions provide an extremely powerful device for 
the description of combinatorial families. The substitution construction is particularly interesting, 
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as it allows us to express other combinatorial constructions, such as the formation of sequences, 
sets, and cycles of structures of a species A, which are specified as Seqo.A, Seto^I, and CYCoA, 
respectively. 

As usual, and to avoid clumsy expressions with many brackets, we make the convention that in 
expressions involving several of the symbols +, -,0, the symbol o binds stronger than the symbol 
•, and the symbol • binds stronger than the symbol +. 

There are explicit rules to compute the cycle index sum for the basic species and for each 
construction. 

Definition 2.4. Given two power series / := f(xi, £2,^3 ■ • ■) and g := g(xi,X2,Xa, ■ ■ ■ ) such that 
5(0,0, . . .) = 0, the plethystic composition of / and g, as defined in [2], is the power series 

f°9 ■= f(9i, 52,53, •••)) with.% = g(x k ,X2k,x 3k ,---) • (6) 
In other words, / o g is the series / where each variable x k is replaced by g(x k , X2fc, • ■ •)■ 

Proposition 2 (Polya, Bergeron et al. 0). For each of the basic species {0,1, X, Seq'^, Set^, 
Cyc'^, Seq ; Set, Cyc}, the associated cycle index sum has an explicit expression, as given in 
Figure [7J For each of the fundamental constructions A G there is an explicit rule to 

compute the cycle index sum of the species A AH, as given in Figured 

Remark 2.1. The ordinary generating series of a species can be obtained from the cycle index 
sums for the species. For the sum and product constructions, we obtain 

G = A + -B =*> e(x)=A(x) + %(x), (7) 

e=A-T> =*> G(x) =A(x) -^(x). (8) 

For the substitution construction, the computation rule is 

e=AoT, G(x) = Z A (T,(x),'B(x 2 ),'B(x :i ),...). (9) 

2.5. Recursive Specifications. It is possible to define species via recursive specifications that 
involve the fundamental constructions introduced above. 

Definition 2.5. A (standard) recursive specification with variables Xi, . . . ,x m over the species 
Ai, . . . ,Ag is a system "J of equations x\ = ei, . . . , x m = e m where each ej is 

• of the form a + b or a • b with a,b € {x\, . . . , x m ,Ai, . . . ,Ai}, or 

• of the form aob with a € {Ai, . . . ,Ai} and b € {xi, . . . , x m ,Ai, . . . , Ag }. 

Under a certain condition, a recursive specification "J with variables xi, . . . ,x m defines new 
species Xi, . . . , X m as follows. We first define for each i > a vector of species %\ , . . . , X„- For 
i = 1, we set (X|j , . . . , !„) = (0, . . . , 0). For i > 1 and 1 < j < m, the species %y is defined 

from ej by substituting for all 1 < k < m the occurrences of x k in ej by xl* 1 . The resulting 
expression only contains species and symbols +, •, and o, and hence evaluates to a species, unless 
ej = a o b and b is substituted by a species that contains structures of size 0. If this case never 
occurs, i.e., if 25 [0] =0 whenever a species 25 is substituted for b in an expression aob, then we 
call , 5 admissible. 

Note that the sequence Xl 1 , Xj. 2 ', ... is monotone, that is, X^' C xi* + for all 1 < k < m and 
all i > 1; this follows from the following basic fact (we omit the proof which is straightforward). 

Proposition 3. 7/ A, A', 2?, 2?' are species, and A C A', 25 C 25', i/ien ^1 + 25 C A' + 25' , 
A • 25 C A' • 25', and A o 25 C A' o 25'. 

Note that due to this proposition, it is easy to decide for a given recursive specification whether 
or not it is admissible (given also the information which of the species A\, . . . ,Ag contains struc- 
tures of size 0). 
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Definition 2.6. Let ^> be an admissible recursive specification with variables x\, . . . ,x m over 
the species A±, . . . and let the species xj 8 ' be as described above. If for each n, the set 
Ui>i ^''H is finite, then the species Xi, . . . , X m specified by '& are defined as follows. We set 
Xj[n] = Ui>i [ n ]- By monotonicity, for each n we have that Xj[n] equals X^^n] for some fc, 
and so we can define the transport of X^ by Xj [a] = [a] . 

Definition 2.7. Let 21 be a class of species. The class of species that is decomposable over 21 is 
the smallest class of species *B that contains 21, and that contains all species that can be defined 
by recursive specifications over species from < B. 

2.6. Decomposition of symmetries. In this subsection we provide a proof of Proposition [2j 
The proof relies on a precise description of the nature of the automorphisms for the basic species 
and for the species obtained from one of the constructions +, ■, or o. Even though proofs and 
details can already be found in [5] , we give our own presentation here since we build on this later 
on, in particular to define random generation rules (Section [5]). 

2.6.1. Automorphisms of some basic species. By convention, the neutral species 1 has the cycle- 
index sum 1 (the structure of size is assumed to be fixed by the "empty" automorphism, of 
weight 1). 

The cycle index sum of X is s\. The only X-structures are over U with \U\ = 1, and all 
automorphisms of such structures consist of a single cycle that has weight si. 

For the species Seq, the only automorphisms are the identity, and the identity has weight sf/fc!. 
As there are fc! sequences of length k, the cycle index sum of Seq'^ is s\. Since Seq = J2k>o Seq'^ , 
the cycle index sum for Seq is 



= £ s i' = T^- (io) 

f— ' 1 — Si 



■^Seq 

fc>0 

For the species Cyc, the automorphisms are exactly the 'shifts': for a cycle (v±, . . . , Wfc), the shift 
of this cycle by m £ [0..fc — 1] maps Vi to Vi +m where the indices are modulo k. The automorphisms 
consist of k/r cycles of length r, where r is the order of m in Z/(fcZ). For each divisor r of k, 
there are <f>(r) elements of order r in Z/(fcZ), where </>(.) is the Euler totient function. Hence, for 
each cycle of length fc, the sum of the weight-monomials over all the cycles of size r is <p(r)sr^ r /kl, 
and the sum of the weight-monomials over all the symmetries is J^rlfe 4 l (' r ) s r^ r /kl. As there are 
(fc — 1)! cycles of length fc, the cycle index sum of CYc' fc ' is 1/fc • J2r\k 4>{ r ) s V r ■ For the species 
Cyc = Ylk>i CYc' fe ' of all cycles, the sum of the weight-monomials over all symmetries of size r, 
which we denote by Zcrc,r, is thus 

*w = E - — E ls ™ = — 1<* ( t~ — 1 • (") 

* — ' fc r * — ' m r \ 1 — s r J 

k,r\k m>l v / 

Therefore, summing Zcyc,t over r > 1, one obtains the expression of Zqyc given in Figure [TJ 

For the species Set, the automorphisms arc all permutations. Hence, the cycle index sum for 
Set is simply the exponential generating series for all permutations, where each cycle of length i 
is marked by a variable Sj. In other words, when f(x) denotes the exponential generating scries 
for permutations, and fix; si, S2, S3, . . .) denotes the same generating series where each variable s, 
marks the number of cycles of length i, then the rules for computing exponential generating series 
yield 



\*>i 




I • (12) 

Hence, /(l; sx, S2, S3, • • •) is the cycle index sum for Set, and the cycle index sum for Set^' is 
[x k ]f(x; si, S2, S3, . . .), which corresponds to a restriction to permutations of size fc. Notice that 
Z SET [k] is always a polynomial; for instance, we have 

^SetI 2 ! = ^( s l+ s 2), ^SetI 3 ! = ^ («1 + 3s 2 Si + 2s 3 ) , Z Set [4] = ^(s^ + 6sf s 2 + 8S1S3 + 3s| + 6s 4 ). 
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2.6.2. Automorphisms related to the constructions. We now describe precisely how a symmetry of 
a species C = A A 23, with A € { + , •, o}, is assembled from symmetries on the species A and 23. 

Disjoint union. Clearly, each symmetry of C = A + 23 is either a symmetry of A or of 23 
depending on whether the underlying structure is in A or in 23. Consequently, Sym(j4 + 23) is the 
disjoint union of Sym(.A) and Sym(23). This directly yields the formula Z/t+s = Zj\ + Z%. 

Product. Consider a product species C = A • 23. Then there is the following bijective corre- 
spondence between symmetries of C and ordered pairs of a symmetry of A and a symmetry of 23: 
when ((A, B), a) is a symmetry of 6 where U are the atoms of A and V are the atoms of B, then 
((A, B),a) is in correspondence with the symmetry (A,a\u) of A and the symmetry (B,a\v) of 
23 (where a\u and a\y denote the restriction of a to U and V, respectively). 

Hence, Zq = Zji-Z-y, . (Indeed, for the species C the cycle index sum acts like an exponential gen- 
crating series for the species Sym(C) of symmetries when taking the refined weights sj 1 ■ • ■ 
instead of x n /nl.) 

Substitution. For two species A and 23 with 23 [0] = 0, let 6 = A o 23. To understand the 
automorphisms of C-structures, consider a C-structure C = (7r, A, (B p ) p ^) over [l..n]. Let a be 
an automorphism of C . It is clear that if two atoms v\ and v 2 of C are in the same component, 
then cr(wi) and o~(v 2 ) have to be on the same component as well, by definition of the transport for 
C; moreover, a induces an automorphism r of A. 

Consider an atom v from a component B p of C, and let k be the length of the cycle of r 
containing v. Note that o~ k maps B p to itself, so a k induces an automorphism on B p , the resulting 
symmetry being denoted by (B p ,o~ p ). Consider the cycle c = (pi, ■ ■ ■ ,Pk) m T where p\ = p. 
Observe that the symmetries (B Pi ,o~ Pi ), for i £ [k], can be seen as k copies of the same symmetry 
of 23, which we denote by (B c , a c ). For each cycle d = (wi, . . . , we) of cr c , let (d\, . . . , dk) be the 
copies of d at (pi, . . . ,Pk), respectively. Then one can merge di, . . . ,dk into a unique cycle of 
length I ■ k using a specific operation which we call composition of cycles. 

Definition 2.8. Let d = (vi, . . . , vt) be a cycle of atoms from [l..n], with v\ the atom of d having 
the smallest label. Let d\, . . . , dk be a sequence of k copies of the cycle. Then the composed cycle 
of di , . . . , dk is the cycle of atoms of length Ik such that, for 1 < i < k and 1 < j < I, the successor 
of the atom Vj in di is the atom Vj in d^+i; and for 1 < j < i, the successor of the atom Vj in dk 
is the atom fy+i) mod e in. di. 

This definition correctly reflects how each cycle of a is assembled from copies of cycles that are 
on isomorphic components. Indeed, walking k steps forward on the composed cycle corresponds 
to walking one step forward on one fixed cycle, which corresponds to the fact that the induced 
automorphism on each component B Vi is the effect of a iterated k times. 

In each symmetry (A,o~) where A is an .A-structure over [l..n], the automorphism a induces a 
partition of [l..n] corresponding to the cycles of a. We say that a has type t — (ci, . . . , c„) when 
Ci is the number of cycles of length i in a . Note that n = X)™=i ( an d such integer sequences 
(ci, . . . , c„) will also be called partition sequences (of order n)). 

To compute the cycle index sum Zji ^, we first choose a type t = (ci, . . . , c n ) of a permutation 
of [l..n]. The number a(t) of symmetries (A, a) of the species A where a is of type t equals 
n\ ■ [s^Sj 2 ■ ■ ■ s n™](^)- To obtain a symmetry (C, p) of A o 23 with C = (tt, A, B) where p induces 
on A a permutation a of type t, we choose a symmetry of A of type t, and then choose Ci symmetries 
of 23 for each i. The sum Z^oS °f ^ ne weight-monomials for all those symmetries is therefore 



Z aI-b = ~^T & i lfe 2 2 ' ' ' b n" > w here 6* := Z 3 {si,s 2 i, s$ 



)• 



(13) 



Summing over all possible types t of permutations, one obtains 

Zao-b = Z A (bi,b 2 ,b 3 ,..), where h := Z- B (s t , s 2i , s 3 . 



)• 



(14) 



3. Cycle-pointed Species 



In this section we introduce cycle-pointed species, and our unbiased pointing operator. 
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3.1. Cycle-Pointed Species. Let A be a species. Then the cycle-pointed species of A, denoted 
by A , is defined as follows. For a finite set U, the set .A°[L/] is defined to be the set of all 
pairs P = (A, c) where A G A[U] and c = (v±, . . . , v£) is a cycle of atoms of A such that there 
exists at least one automorphism of A having c as one of its cycles (i.e., (v\, . . . , vt) is mapped to 
(V2, ■ ■ ■ ,ve,vi)). The cycle c is called the marked cycle (or pointed cycle) of P, and A is called 
the underlying structure of P. Note that cycle-pointed species are in particular species. Thus, the 
theory from the previous section also applies to cycle-pointed species. 

An automorphism a of A having c as one of its cycles is called a c- automorphism of P, and the 
other cycles of a are called unmarked. By definition, two cycle-pointed structures P and P' are 
isomorphic if there exists an isomorphism from the underlying structure of P to the underlying 
structure of P' that maps the marked cycle of P to the marked cycle of P' (i.e., each atom of 
the marked cycle of P is mapped to an atom of the marked cycle of P', and the cyclic order is 
preserved) . 

Definition 3.1. A species CP is called cycle-pointed if CP C A , for some species A. For I > 1, CP(^) 
is the species that consists of those structures from CP whose marked cycle has length I. 

We define A® to be the subspecies of A° where in all structures the marked cycle has length 
greater than 1, i.e., A® = J2e>2(-^-°)w- cycle-pointed species that will be considered in the 
applications - except for maps - are either of the form A or A®.) 

Definition 3.2. Let CP be a cycle-pointed species. We define CP+ to be the species obtained from 
CP by removing the marked cycle from all CP-structures; that is, the set of CP^-structures on [l..n] is 
{A | (A, c) G CP n for some cycle c of atoms in A}. 

Clearly, for any species A we have that (A°y = A. 

3.2. Cycle Index Sums. In order to develop Polya theory for cycle-pointed species, we introduce 
the terminology of c-symmetry and rooted c-symmetry. Given a cycle-pointed species CP, a c- 
symmetry on CP is a pair (P, a) where P ~ (A, c) is a cycle-pointed structure in CP and a is a 
c-automorphism of A. A rooted c-symmetry is a triple (P, er, v) , where (P, a) is a c-symmetry, 
and v is one of the atoms of the marked cycle of P; this atom v is called the root of the rooted 
c-symmetry. The species of rooted c-symmetries of CP is denoted by RSym(CP). 

The weight-monomial of a rooted c-symmetry of size n is defined as 

1 " 

W(P,a,v) ■= — h~[[Si , (15) 

i=l 

where tt and the Sj's are formal variables, i is the length of the marked cycle, and for i 6 [n], nj(er) 
is the number of unmarked cycles of a of length i, i.e., rii(a) = Cj(cr) ii i ^ £ and n^(er) = ce(a) — 1. 
For simplicity, in the following we will write i%i for ni{o~) if the corresponding automorphism is 
clear from the context. We define the pointed cycle index sum Zy(s\, t\; S2, t%\ . . .) of CP, denoted 
by Zy, as the sum of the weight- monomials over all rooted c-symmctrics of CP, 

Z v (s 1 ,t 1 ;s 2 ,t 2 ;...):= w (P^,v)- (16) 

(P,a-,i>)eRSym(y) 

The following lemma is the counterpart of Lemma [1] for cycle-pointed species; it shows that 
pointed cycle index sums refine ordinary generating functions. Recall that for a species CP, the set 
CP denotes the unlabeled CP-structures, see Section |2"T2"1 

Lemma 4. Let CP be a cycle-pointed species. For n > 0, each unlabeled structure P € CP„ gives 
rise to exactly n\ rooted c-symmetries, i.e., there are nl rooted c-symmetries (P,a,v) such that 
P G P. As a consequence, 

T{x) = Z y (x,x;x 2 ,x 2 ;...). (17) 

Proof. Lemma [T] implies that P n gives rise to n! symmetries. Now we establish a bijection between 
these symmetries and rooted c-symmctries from P n . Fix a c-symmetry (Pq, cto) where Po is from P„ 
(by definition of cycle-pointed species, such a c-symmetry exists). Now, let (Pi, a\) be a symmetry 
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Figure 2. An unlabeled nonplane tree of size 4 yields 4 unlabeled cycle-pointed trees. 

of y where P\ is also from P n . The marked cycle (vi, . ■ . ,vi) of Pi is preserved by oi, so that all its 
elements are shifted by the same value r £ [l..£] modulo £, i.e., oi maps vi to Vn+r) mo( j e- Since 
both Po and P\ are from P n , there is an isomorphism <j from P\ to Pq. Moreover, the permutation 
t := (j~ uqg is a c-symmetry of Pi. Observe that the permutation T~ r+1 o\ is an automorphism 
of Pi that moves an atom of the marked cycle r steps forward (because of o~\) and then r — 1 
steps backward (because of r _r+1 ). Hence, t~ t+1 <ti is a c-symmctry of Pi. The desired bijcction 
maps (Pi,cti) to the rooted c-symmctry (Pi,r — r+ cri,v) where v is the atom of the marked cycle 
having the r-th smallest label. This correspondence can be inverted easily, and hence we have 
found a bijcction between the symmetries and the rooted c-symmetrics for structures having P n 
as unlabeled structure. □ 

Observe that a rooted c-symmetry of A is obtained from a symmetry (A, a) of A by choosing 
an atom v of A and marking the cycle of a containing v. Therefore, each symmetry (^4, a) of size 
n of the species A yields n rooted c-symmctrics on A , and hence 

Z A o(s 1 ,t 1 ;s 2 ,t 2 ; ■ ■ -)l{ti=si,t2=s2,...} = 4r z A(tsi,t 2 s 2 ,t 3 s 3 , . . .)|{ t= i}. (18) 

Further, a rooted c-symmetry of A° can equivalently be obtained by marking a cycle of atoms that 
corresponds to a cycle of a and choosing an atom of the cycle as the root of the rooted c-symmctry. 
Together, these observations yield the equality 

& 

Z A ^(sx,tx;s 2 ,t 2 -,...)\{ tl=SlM=s ^...- i =£t i —Z A (s 1 ,s 2 ,...) for all I > 1. (19) 

For I = 1, which corresponds to structures of A with a unique distinguished vertex (the root), we 
recover the well-known equation relating the cycle index sum of a species and of the associated 
rooted species; see [2j Sec. 1.4.] and [16]. 

As stated below and illustrated in Figure [21 pointing a cycle of symmetry instead of a single 
atom (as the classical pointing operator does) yields an unbiased pointing operator in the unlabeled 
setting. 

Theorem 5 (unbiased pointing). Let A be a species. Then, for n > 0, each unlabeled structure of 
A n gives rise to exactly n unlabeled structures in A° n ; that is, for each A <G A[n] there are exactly 
n non-isomorphic structures (A, c) in A° [n] . Hence, the ordinary generating series of A° satisfies 

X°(x) = x— A(x). (20) 
dx 

Proof. Given A € A n , let S be the set of unlabeled pointed structures of A° whose underlying 
unpointed structure is A. The proof of the lemma reduces to proving that S has cardinality n. 
Let Sym(A) be the set of symmetries for the structure A, and let RSym(S') be the set of rooted 
c-symmetries for structures from S. Lemma[T]shows that A has n\ symmetries and Lemma|4]shows 
that each structure of A„ has n! rooted c-symmetries. Hence |Sym(^4)| = n\ and |RSym(5')| = 
IS'ln!. In addition, wc have seen that a symmetry (A, a) of size n on A has n rooted c-symmetrics 
on A . Hence, |RSym(S')| = n|Sym(^4)|. Thus, we obtain IS'ln! = nnl, and therefore IS"! = n. □ 

Remark 3.1. Theorem [5] is equivalent to a result known as Parker's lemma Section 2.8]. For a 
subgroup G of the symmetric group ©„, say that a cycle c in g G G is equivalent to a cycle c' in 
g 1 € G if there exists h g G that maps the elements of c to the elements of c' and preserves the 
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Basic cycle-pointed species: A° , A®, 

where A e {0, 1,X, SEQ [fc] , SET [k] , CYC [fc] , Seq, Set, Cyc} 
Pointed cycle-index sum: 

Za° = ^ I t&-p. — Za 

e>i e 

Zj\% = y^i t(—z A 

e>2 Se 


Construction 


Notation 


Pointed cycle index sum 


Pointed union 
Pointed product 
Pointed substitution 


ft = ft + Q 
ft = ft*23 
ft = ft® s 


Zjj = + 
Zjl = Zy • Z% 
Zy_ = Zy © Z's, 



Figure 3. The rules to calculate the pointed cycle index sums of basic cycle- 
pointed species and of cycle-pointed species composed by +, *, and ©. 

cyclic order, i.e., for each element x € c, the successor of x in c is mapped by h to the successor of 
h{x) in c! . Let be the number of inequivalcnt cycles of length k. Then Parker's lemma states 
that J2l=i a k = n - 

If this lemma is applied to the automorphism group of a fixed structure A of size n, then is 
the number of unlabeled cycle-pointed structures arising from A and such that the marked cycle 
has length k. So Parker's lemma states that there are n unlabeled cycle-pointed structures arising 
from A, i.e., it implies Theorem [S] (conversely, each permutation group is the automorphism group 
of a structure, so Parker's lemma can be deduced from Theorem [S]). 

Remark 3.2. The classical pointing operator, which selects a single atom in a structure, yields an 
equation similar to (|20[) for exponential generating series, which is useful for labeled enumeration. 
Given a species A, let A' be the species of structures from A where an atom is distinguished. 
Then 

A'(x) = x^-A(x). (21) 
ax 

An important contribution of this article is to define a pointing operator A i— > A° that yields the 
same equation, Equation (|20[) . in the unlabeled case. 

3.3. Basic Cycle-pointed Species and Constructions. 

3.3.1. Basic cycle-pointed species. Species of the form A° or A® where A is a basic species as 
introduced in Subsection 12.41 will be called basic cycle- pointed species. The derivation rule in 
Equation (|19|) allows us to compute the pointed cycle index sum of basic cycle-pointed species, as 
indicated in Figure [3] (upper part). 

3.3.2. Pointed constructions. It is clear that the disjoint union of two cycle-pointed species as 
defined in Section 12.4.21 is again a cycle-pointed species. We now adapt the constructions of 
product and substitution to obtain a pointed product and a pointed substitution operation that 
produce cycle-pointed species. 

Pointed product. Let A, 23 be species, and let ft C A° be a cycle-pointed species. Then the 
pointed product ft* 25 of ft and 23 is the subspecies of (A • 23)° of all those structures ((A,B),c) 
in (A • 23)° where the pointed cycle c is from A, and (A, c) € ft. 

Pointed substitution. Let E = ((tt, A, (i? p ) p67r ), c) be a structure in (A o 23)°. The study of 
automorphisms of structures in A o 23 performed in Section 12.6.21 shows that 

• c is the cycle composed from cycles ci, . . . , Ck on different components B P1 , . . . , B Pk ; 

• the structure (A, d = (pi, ...,pk)) is cycle-pointed. 
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FIGURE 4. A cycle-pointed graph obtained from a substitution. 



We call (A, c') € A° the core- structure of E. 

We can now define a substitution construction for cycle-pointed species. Let 7 C A° and let 
23 be a species such that 23 [0] = 0. Then 7 © 23 is defined as the subspecies of structures from 
(.A o 23)° whose core-structure is in 7. 

As in the previous section, we make the convention that in expressions involving several of the 
symbols +,•,*, o,©, the symbols o, © bind stronger than the symbols and the symbols 
bind stronger than the symbol +. 

In a similar way to the labeled framework [2] , our pointing operator behaves well with the three 
constructions +, •, and o: 

Proposition 6. The cycle-pointing operator obeys the following rules: 



Proof. It is easy to see that (A + 23)° is isomorphic to A° +23°. For the product, note that 
the pointed cycle c of a structures from (A • 23)° has to be entirely on A or entirely on B. The 
species that contains all structures ((A, B), c) where c is on A is isomorphic to A° ★ 23, and the 
species that contains all structures ((A,B),c) where c is on B is isomorphic to 23° * A. Hence, 
(A • 23)° = A° *23 + 23° *A. For the expression for the substitution operation, we in fact have not 
only isomorphism, but even equality of species. This is clear from the fact that all core-structures 
of structures from (A o 23)° are from A° . □ 

As in the unpointed case, there are explicit rules to compute the pointed cycle index sums 
for each basic species and for each construction. To this end we need the following notion of 
composition for power series. 

Definition 3.3. Let / and g be two power series of the form / := f(xi,yi;x2,y2',---) and 
g := g{x\, x%, ■ ■ •) such that g(0, 0, . . .) = 0. Then the pointed plethystic composition of / with g is 
the power series 



The following proposition is the counterpart of Proposition [2] for cycle-pointed species. 

Proposition 7 (computation rules for pointed cycle index sums). For each basic species A, the 
pointed cycle index sum of the pointed species A° and A® is given by the explicit expression given 
in Figured ( upper part) in terms of the cycle index sum of A. For each of the fundamental pointed 
constructions +, *, and © there is an explicit rule, given in Figure^ (lower part), to compute the 
pointed cycle index sum of the resulting species. 



(A + 'B) 
(A ■ 23)° 
(Ao<B)° 



A° + 23°, 

,A°*23 + B°*A 
■A°@23 . 



(22) 
(23) 
(24) 



f©9 ■= f(9i,hx;g2,h 2 ;...), 
with g k = g{x k ,x 2 k,x 3k , . . .) and h k = h(x k ,y k ;x 2k ,y2k; ■ ■ •) for h := J2e>i ^i^9- 



(25) 
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Proof. The statement is clear for the cycle-pointed atomic species and the pointed union. Let A, 23 
be species, and let CP C A° be a cycle-pointed species. For the cycle-pointed product Q = CP * 23 
notice that, similarly as for a partitional product for species, a rooted c-symmetry on Q decomposes 
into a rooted c-symmetry on CP and a symmetry on 23, since the automorphism has to act separately 
on the two component structures. Therefore, RSym(Q) can be considered as a partitional product 
of RSym(CP) and Sym(23), which yields Z Q = Zy • Z*. 

For the substitution construction, the proof is similar. Let CP C A° be a cycle-pointed species 
and let 23 be a species such that 23 [0] = 0. Let Q = CP © 23 C (A o 23)°. Consider a rooted 
c-symmetry (Q,cr, v). As we have seen in Section 12.61 the core structure A is endowed with an 
induced automorphism a' . In addition, the automorphism is naturally rooted at the atom v' G A 
where the 23°-component that contains the root v is substituted. We denote the cycle of a' that 
contains v' by c. Now we have that (A,c') is cycle-pointed and the automorphism a' rooted 
at v 1 is a rooted c-symmetry on P = (A,cJ). In addition, the components substituted at each 
atom of a cycle c = (ui, ...,Uk) of a' arc isomorphic copies of a same symmetry on 23. The 
components that are substituted at the atoms of the marked cycle c' are naturally rooted at the 
isomorphic rcprescntant of v. Finally, this decomposition is rcvcrsable: one can go back to the 
original composed symmetry using the composition of cycle operation. 

To express these observations in an equation, we define the type of the rooted c-symmetry 
(P, p, v) with P = (A, c') to be the sequence (£; ni,n 2 , . . . , n^) where £ is the length of c' , and is 
the number of unmarked cycles of length i in p. Note that the size of P is £ + ^ in,. The number 
a(t) of rooted c-symmetries of type t = (£; ni, n 2 , ■ • • , n n ) on CP is nl[t s™ 1 • • • s™"]Zy. The core 
type of a rooted c-symmetry on CP © 23 is defined as the type of the rooted c-symmetry induced on 
the core structure. 

Let Zj^ be the pointed cycle index sum of CP restricted to the rooted c-symmetries with core-type 
t = (£; m, 77.2, ■ • ■ , n n ). From the above discussion, we have 

Z% = —fqeb^b^ 2 where 6, := Z s (si, s 2 , . . .), qt := (si, h\ s 2 e, tui ■ ■ ■)■ (26) 

Summing over all possible types of rooted c-symmetries t, we obtain 

Z% = Zp(bi,qi; 62,92; •• .)> where bi := Z%(s 1 ,s 2 , ■■ .), q& := Z%o (s e ,tf, s 2 e, he; ■ ■ •)• 1=1 

In the following we introduce recursive specifications that involve pointed constructions. Cycle- 
pointed specifications are like standard recursive specifications (Definition [53]), but with two sorts 
of variables (where one is reserved for cycle-pointed species) and where we are allowed to use 
additionally the pointed constructions. 

Definition 3.4. A recursive cycle-pointed specification with variables x±, . . . , x m , y\,..., y m ' over 
the species A\, ... ,Ai and over cycle-pointed species 23 1, . . . , 23 ^ is a system ^ of equations x\ = 
ei, . . . , x m = e m , yi = /1, . . . , y m > = f m > where each e l is 

• of the form a + b or a • b with a, & £ {xi, . . . , x m ,Ai, . . . , A£\, or 

• of the form aob with a G {Ai, . . . ,Ai} and b € {xi, . . . , x m ,Ai, . . . ,A{\, 

and each /j is 

• of the form a + b with a, b £ {y\, . . . , y m > , 23i, . . . , 23^}, or 

• of the form a-kb with a £ {yi, . . . , y m >, CBi, ...,23^} and b <S {x\, . . . , x m ,Ai, . . . ,Ag], or 

• of the form a©b with a £ {23i, . . . , 23^} and b £ {xi, . . . , x m , A\, . . . ,Ai}. 

To define the species Xi, . . . , X m , Vi, . . . , y m ' that are given by a recursive cycle-pointed spec- 
ification ^/ with variables x\, . . . , x m , yi, . . . , y m > over the species A±, . . . ,Ae, 23i, . . . , 23^ where 
23i, . . . , 23fe are pointed, we again (as in Section 123)) consider sequences of species and ty- for 

i > 1. For i = 1, we define Xf = for all 1 < j < m, and = 0° for all 1 < j < rri . For i> 1 
the species X^ and are obtained by evaluating the corresponding expressions for Xj and yj , 
respectively (as in Section l2.5p . We say that VP is admissible if in expressions of the form a o b or 
a © b the species substituted for b never contain structures of size 0. 
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Note that also the new pointed constructions are monotone, so in case that for each n the sets 
Ui>i ^ [ n ] an£ i Ui>i^ [ n ] are finite it is straightforward (and analogous to Definition 12.61) to 
define the species Xi, . . . , X m , ^i, . . . , specified by admissible recursive specifications ^ over 
A\, . . . ,Ag, Si, . . . , 

Definition 3.5. Let 21 be a class of species. The class of species that is cycle-pointing decomposable 
over 21 is the smallest class of species 58 that contains 21, contains all species that can be specified 
by cycle-pointed recursive specifications over pointed and unpointed species from <8, and contains 
all species obtained from species of the form A in 03 by applying the unpointing operation. 

Plenty of examples of species that are decomposable over simple basic species can be found in 
Section |U 

Proposition 8. If a species A is decomposable (in the sense of Definition \2.7\ ! , then the pointed 
species A° is cycle-pointing-decomposable. 

Proof. Follows directly from Proposition |H1 □ 

Remark 3.3. The ordinary generating series inherit simple computation rules from the ones for 
pointed cycle index sums. As expected, for the sum and product constructions, one gets 

R = J> + Q =s> $.(x) = V(x) + Q(x), (27) 

Q = 7 * T> => Q(x) = ?{x) ■ %(x). (28) 

For the substitution construction, the computation rule is: 

Q = y@'B => Q(x) = ZyCB(.T),^(x);5(x 2 ),^(x 2 );...), (29) 

where H°(x) = x4--T>(x). Hence, to compute the ordinary generating series of a decomposable 
cycle-pointed species, the only place where the cycle index sum or pointed cycle index sum is 
needed (as a refinement of ordinary generating series) is for the species that is the first argument 
of a substitution or pointed substitution construction. 

Remark 3.4. As an exercise, the reader can check just by standard algebraic manipulations that the 
computation rules for cycle- index sums are consistent with Proposition [5] For instance, proving 
Z(Ao r B)° = is equivalent (by the computation rules) to proving the equality ^(yioS)° = 

Zj\o © Zs, which reduces to checking the following identity on power series: 

A(fag) = (Af)®g, (30) 

where A is the operator that associates to a power series f(x\, X2, X3, . . .) the power series 

Q 

Af(xi,yx;x 2 ,y2\ ...):= 2_,l yi-g—f(xi,x 2 ,x s ,-- •)• 

Similarly, to prove ^(yi+3)° = -^yi o +s o an d = ^A°*cB+s°*yi, one has to check the identities 

A(/ + g) = Af + Ag, and A(/ • g) = Af ■ g + f ■ Ag, respectively. 

4. Application to Enumeration 

In this section wc demonstrate that cycle-pointing provides a new way of counting many classes 
of combinatorial structures in the unlabeled setting. Typically, species satisfying a "tree-like" 
decomposition are amenable to our method. This includes of course species of trees, but also 
species of graphs (provided that the species is closed under taking 2-connected components, and 
that the sub-species of 2-connected graphs is tractable), and species of planar maps. 

The general scheme to enumerate unlabeled structures of a species A, i.e., to obtain the coef- 
ficients \A n \, is as follows. First, we observe that the task is equivalent to the task to enumerate 
unlabeled cycle-pointed structures from A , because \A„ \ = n\A n \. Enumeration for A° turns out 
to be easier since the marked cycle usually provides a starting point for a recursive decomposition. 
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For a cycle-pointed species of trees (and more generally for species satisfying tree-like decom- 
positions), the first step of the decomposition scheme is to distinguish whether the marked cycle 
has length 1 or greater than 1. The general equation is 

A° = X°*A'+A®, (31) 

where A' is the derived species of A, consisting of structures from A where one atom is marked 
with a special label, say *, as defined in [2~pl . Then each of the two species A' and A® has to be 
decomposed. For derived structures (the species A') we follow the classical root decomposition. 
For symmetric cycle-pointed structures (the species A®) our decomposition strategy is different, 
and leads us to introduce the notion of center of symmetry. 

4.1. Trees. We first illustrate our decomposition method for trees, which are defined as connected 
acyclic graphs (i.e., unless mentioned otherwise, trees are unrooted), and we start with the formal 
definition of the center of symmetry. Let T be a symmetric cycle-pointed tree. A path of T 
connecting two consecutive atoms of the marked cycle is called a connecting path (thus the number 
of connecting paths is the size of the marked cycle) . 

Claim 9 (center of symmetry) . Given a symmetric cycle-pointed tree T, all connecting paths of T 
share the same middle v c , called the central point for the marked cycle ofT. The central point v c 
is the middle of an edge e if these paths have odd length, and is a vertex v if these paths have even 
length. In the first (second) case, the edge e (the vertex v, resp.) is called the center of symmetry 
ofT. 

Proof. We prove here that all connecting paths share the same middle. Let U be the subgraph of 
T formed by the union of all connecting paths. Observe that U is connected, so U is a subtree of 
T. In addition, U is globally fixed by any c- automorphism of T (indeed, the property of being on 
a connecting path is invariant under the action of a c-automorphism) , and it contains the atoms 
of the marked cycle of T. Hence U is the underlying structure of a cycle-pointed tree (U,c). 
Consider the classical center of U , obtained by pruning the leaves (at each step, all leaves are 
simultaneously deleted) until the resulting tree is reduced to an edge or a vertex [2J. The central 
point v c of U is defined as follows: if the center of U is a vertex v, then v c := v, if the center of 
U is an edge e, then v c is the middle of e. Let a be a c-automorphism of (U, c) and let (a) be 
the group of automorphisms generated by a. It is well known that the central point is fixed by 
any automorphism on the tree, hence v c is equidistant from all atoms of the marked cycle, as the 
group (a) acts transitively on the vertices of the marked cycle. In addition, v c is on at least one 
connecting path of T (because U is the union of these connecting paths). Hence, v c has to be on 
all connecting paths, as the group (a) acts transitively on the connecting paths. Thus, v c has to 
be the middle of all connecting paths simultaneously. □ 

Remark 4.1. Notice that the center of symmetry might not coincide with the classical center of 
the tree, as shown in Figure [5j However, in the case of plane trees, the two notions of center 
coincide. 

4.1.1. Nonplane trees. Let 5" be the species of free trees, i.e., unrooted nonplane trees (equivalently, 
acyclic connected graphs), where the vertices are taken as atoms. Let 3 r> be the derived species 
of 2r (also the species of derived nonplane trees). Rooted nonplane trees can be decomposed at 
the root. Since the root does not count as an atom and since the children of the root node are 
unordered, we classically have 

J' = Set o (X • J'). (32) 

In contrast, the decomposition of symmetric cycle-pointed trees docs not start at atoms of the 
marked cycle, but at the center of symmetry, which is either an edge or a vertex (see Figure [5] for 
an illustration of the decomposition). In order to write down the decomposition, we introduce the 

-^Note that the derived species A' is not cycle-pointed. However, it can be identified with A^y Indeed, by 
adding a new label and a pointing loop on the marked atom, one obtains a bijective correspondence between 
and X° *A'. 
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Figure 5. Decomposition of a nonplane tree at its center of symmetry (in case 
the center of symmetry is a vertex). 



species L consisting of a single one-edge graph. Note that L ~ Set' 2 ', and that £® consists of the 
link graph carrying a marked cycle of length 2 that exchanges the two extremities of the edge. 

Claim 10. The species 5"® of symmetric cycle-pointed free trees satisfies 



where ft := X • 3" is the species of all pointed trees. 

Proof. Consider a tree produced from the species L® © ft + X • Set® © ft (for an example of a 
tree produced from X • Set® © ft, see the transition between the right and the left drawing in 
Figure [5]). Clearly, such a tree is free and cycle-pointed and it is symmetric because the marked 
cycle of the core-structure - an edge e in the first case, a cycle-pointed set attached to a vertex v 
in the second case - already has length greater than 1. Hence 5F® D L® ©ft + X • Set® © ft . 
Notice also that in the first (second) case, e (v, respectively) is the center of symmetry of the 
resulting tree. Indeed each connecting path connects vertices on two different subtrees attached 
at the center of symmetry, which, by symmetry, stands in the middle of such a path. 

Conversely, for each symmetric cycle-pointed free tree T, we color blue its center of symme- 
try, which plays the role of a core- structure for T. Partition J® as 3® + J®, where 3"® (J®, 
respectively) gathers the trees in 3® whose center of symmetry is a vertex (an edge, respectively). 
Define also M v (M e ) as the species of free trees with a distinguished vertex (edge, resp.) that 
is colored blue. Clearly M v = X • (Set o ft) and M c = L o ft. From Proposition [51 we obtain 
M® = X° * (Set q ft) + X • Set° © ft and M° = L° © ft. Observe that 3f (3f) contains the 
structures of M® (of M®) where the blue vertex (edge, resp.) is the center of symmetry. It is 
clear that the structures of X° * (Set o 3V) have their marked cycle of length 1, so they are not in 
5"®. Concerning the structures of X • Set°^ © 3L, the atoms of the marked cycle are on a same 
subtree attached at the blue vertex, so that the blue vertex is not the center of symmetry. Hence 
the structures of X • Set"^ © 31 are not in 9\®. Similarly, the structures of © 31 arc not in 

Therefore we obtain the second inclusion J® C £® © 31 + X • Set® © 31. □ 

Proposition 11 (decomposing and counting free trees). The species 3^° of cycle-pointed free trees 
has the following cycle-pointed recursive specification over the species Set, L® , X : 



J® = £® © 31 + X ■ Set® © 31, 



(33) 




X°*?' + J®, 

Set oft, 31 = X -J', 

L® © 3L + X ■ Set® ©ft, 

X° * (Set o ft) + X • Set © ft, 



(34) 
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The ordinary generating junction f{x) := of free trees satisfies the equations 

xf'{x) = r(x)+x 2 r'{x 2 )+ ^J2x e r'(x e )^jr(x) (35) 

= xr'(x)(l -r(x)) + x 2 r'(x 2 ), (36) 
1 

i>l 



where r(x) is specified by r(x) = .xcxp | — r(x l ) 



Proof. The first three lines of the grammar are Equations (|3Tj) . (|32|) . and ([33]) . respectively. The 
fourth lincQ is obtained from the second line (i.e., Ji = X • (Set o Ji)) using the derivation rules of 
Proposition [S] 

Concerning the OGSs, let r(x) := $,(x) be the OGS of the species Ji. Note that $°(x) = xf'(x) 
and 31° (x) = xr'(x) by Theorem [5] By the computation rules for OGSs (Remark 12.11 and Re- 
mark l3.3j ). the second line of the grammar, i.e., Ji = A-(SEToJi), yields r(x) = xcxp(J2i>i r {x l )/i); 
and the third line of the grammar yields 

3®{x) = Z LS> (r(x), xr'(x); r{x 2 ), x 2 r'{x 2 ); . . .) + xZ Set ® (r(x), xr'(x); r(x 2 ), x 2 r'{x 2 ); . . .). 

Applying the derivation rule ([TO]) to L = Set' 2 ' and Set, we get the expressions Z £® = t^ and 
^Set® = (E<!>2*i) • z Set- Hence 3®{x) = x 2 r'(x 2 ) + ( E^> 2 ^'O^ )) ' r ( x )- Finally, the first 
line of the grammar yields £F°(a;) = r(x) + 3®{x), which gives Expression (|35|) of xf'(x). Using 
xr'ix) = r(x)(l + Yle>i x e r'(x £ )), this expression simplifies to Expression ([36)) of xf'(x). □ 

Remark 4.2. The expression xf'(x) = xr'(x)(l — r(x)) + x 2 r'(x 2 ) clearly agrees with Otter's 
formula [25] : 

f{x) = r{x)-\{r 2 {x)-r(x 2 )), (37) 

which can be obtained either from Otter's dissimilarity equation or from the dissymmetry theo- 
rem [2J. The new result of our method is to yield an expression for xf'(x) - Equation ([33]) - that 
has only positive signs, as it reflects a positive decomposition grammar. This is crucial to obtain 
random generators without rejection in Section [5] 

All the arguments we have used for free trees can be adapted to decompose and enumerate 
species 3n of trees where the degrees of the vertices lie in a finite integer set CI that contains 1. 
It is helpful to define the auxiliary species Jin that consists of trees from GFq rooted at a leaf that 
does not count as an atom. By decomposing trees at the root, we note that Jin has the recursive 
specification 

Jin = X • SET n _i o Ji a , with SETn-i := U^Set 1 ^ 11 . 
The species Jin serves as elementary rooted species to express the pointed species arising from 3n- 

Proposition 12 (decomposing and counting degree-constrained trees). For any finite set CI of 
positive integers containing 1, let 3n be the species of nonplane trees whose vertex degrees are in 
CI. Then the species 3^ has the following cycle-pointed recursive specification, where SETn := 
U fcG jiSET [fc] andSETn-i := U fce nSET [fc - 1] ; 

qro vo .or / i qr© 

9h' - Set oJJ , Ji n = X • SETn_i o K n , 
J® = L®@ Ji n + X- Set® ® Jin, 
Ji = A *Set o Jin+A-SET^® Ji n . 



2 The fourth line of Equation < 1 34 D is not needed for enumeration, but it is necessary to make the grammar 
completely recursive, and, as such, will be necessary for writing down a random generator in Section [5] 
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The ordinary generating function fn(x) := 3^2 (^) satisfies the equation 

x fn( x ) = xZ SETn (r n (x),ra(x 2 ), . . .) + x 2 r n (x 2 ) + xZ Set@ (m(x), xr' n (x); r a (x 2 ), x 2 r' n (x 2 ); . . .), 

(39) 

where r n (x) is specified by rn(x) = x • Zseto-i ( r n(x), rn(x 2 ), rn(x 3 ), . . .). The power series 
^SETn-D ^SET f j and Z Set ® appearing in the equation are polynomials that can be computed ex- 
plicitly. 

Example 1. Unrooted nonplane binary trees. Trees whose vertex degrees are in := {1,3} are called 
unrooted nonplane binary trees (note that rooting such a tree at a leaf, one obtains a rooted nonplane 
binary tree, i.e., each internal node has two unordered children). In that case, the elementary cycle index 
sums required in Equation (|39[) are 

#SET n _i = 1 + ^S? + |s2, Z SETn = Si + -si + ^«1S2 + ^S3, ^Set® = * 2Sl + * 3 - 

Let f(x) be the OGS of unrooted nonplane binary trees and r(x) the OGS of rooted nonplane binary trees 
(rooted at a leaf that does not count as an atom). Firstly, from the expression of Zsm n _ 1 one obtains 



r{x) = x • (1 + -r{x) 2 + -r{x 2 )). 

Then, Equation fl35]) yields 

xf' (x) = x ■ (r(x) + Q r (x) 3 + -r(x)r(x 2 ) + — r(x 3 )) + x 2 r'(x 2 ) + x ■ (x 2 r (x 2 )r(x) + x 3 r'(x 3 )), 

i.e., 

f'(x) = r(x) + -r(x) 3 + —r(x)r(x 2 ) + -r(x 3 ) + xr' (x 2 ) ■ (1 + xr(x)) + x 3 r'(x 3 ). 

From this equation one can extract the counting coefficients of unrooted nonplane binary trees with respect 
to the number of vertices (after extracting first the coefficients of r(x)): 

xf'(x) = 2 • 1 • x 2 + 4 • 1 • x 4 + 6 ■ 1 ■ x 6 + 8 ■ 1 ■ a; 8 + 10 • 2 • x 10 + 12 ■ 2 ■ x 12 + 14 • 4 • x 14 + 16 • 6 • x 16 + . . . 

Hence the first counting coefficients with respect to the number of internal nodes (starting with internal 
nodes) are 1, 1, 1, 1, 2, 2, 4, 6. Pushing further one gets 1, 1, 1, 1, 2, 2, 4, 6, 11, 18, 37, 66, 135, 265, 552, 
1132, which coincides with Sequence A000672 in [32] (the number of trivalent trees with n nodes). . . □ 

4.1.2. Plane trees. A plane tree is a tree endowed with an explicit embedding in the plane. Hence, 
a plane tree is a tree where the cyclic order around each vertex matters. Let £ be the species of 
plane trees, where again the atoms are the vertices. As usual the startegy to count plane trees is 
to decompose £°, distinguishing whether the marked cycle has length 1 or larger than 1: 

£°=X°*£' + £®. (40) 

The species £' is decomposed with the help of another species of plane trees: denote by A the 
species of plane trees rooted at a leaf which does not count as an atom. Decomposing A at the 
root, we get 

A = X-SmoA. (41) 
Again the species A serves as elementary rooted species to express species of pointed plane trees: 

Proposition 13 (decomposing and counting plane trees). The species £° of cycle-pointed plane 
trees has the following cycle-pointed recursive specification. 

£° = X°*£' + £®, 

£' = CycoA, A = X-SmoA, 

(42) 

£© = L® ®A + X -Cyc® ©A, v ' 

A° = X° * Seq oA + X • Seq° ® A, 

The ordinary generating function £(x) of plane trees satisfies the equation: 

e'W = 1 + E^^TT^+-V) + E*W^. (43) 
e>i v ' e>2 v ; 
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where a(x) is the series of Catalan numbers: a(x) = — ( 1 — y/1 — Ax ) = — — - 



n>0 



2n~ 



By coefficient extraction, one gets the following formula for the number e„ of plane trees with 
11 + 1 vertices (entry AO 0299 5 in [52] ): 



1 



1 fin 
2n\ n 



1 



In 



odd \ 
■ 2n'+l J 



2n' 



(44) 



Proof. The grammar is obtained by arguments similar to those used to derive the grammar (|34j) 
for free trees. The only difference is that the cyclic order of the neighbors around each vertex 
matters, so a Set construction in the grammar for free trees typically has to be replaced by a 
Cyc construction in the grammar for plane trees. □ 

All the arguments apply similarly for species of plane trees where the degrees of vertices are 
constrained. As a counterpart to Proposition [TT] we obtain: 

Proposition 14 (decomposing and counting degree-constrained plane trees). For any finite set 
f2 of positive integers containing I, let £n be the species of free trees where the degrees of vertices 
are constrained to lie in tt. Then the cycle-pointed species Eq is decomposable, it satisfies the 
following decomposition grammar, where CYCn '■= Ufc e nCYc' fc ' and Seq^_ x := UfcgnSEQ^ -1 ' : 



CO 

c n — 



En' = CYCnoAn, A n = X • Seq q _ x o A n , 
£® = £® ®A a + X -Cyc® ®A n , 
A° n = X°*SEQ n _ 1 o./ln + X-SEQ n _ 1 ®.An. 

The ordinary generating function eci(x) :— Eq(x) satisfies the equatioi^: 

P{r la n (x r ) k/r + xa n '(x 2 ) + ]T ${r)x r a n ' {x T )a a {x r ) k ' r -\ 



(45) 



en '{x) 



ken 

r\k 



(46) 



feen 

r|fc,r>l 



where a$i{x) is specified by o,q(x) = x an(x) 

ken 



k — l 



Example 2. d-regular plane trees. For d > 3, a d-regular plane tree is a plane tree such that each internal 
node has degree d, which corresponds to the case Q — {1, d} in Proposition 1 141 It is easily shown that such 
a tree with n internal nodes has m = n(d — 2) + 2 leaves. Let Eu] be the species of d-regular plane trees, 
where the atoms are the leaves (it proves here more convenient to take leaves as atoms and to write the 
counting coefficients according to the number of internal vertices). Let A\j\ := £.[d{ be the corresponding 
derived species, which satisfies A[d] = X + (A\d]) ■ The decomposition for the cycle-pointed species E[ d ° 
is 

£ M ° = X°*A [d] +£ m @A [d] + (Cvc ld] f ®A [d] . 
Hence, the OGS &\d](x) := Eu](x) satisfies: 

xe[ d {(x) = xa [d] (x) + x 2 a [d {(x 2 ) + ^ <f)(r)x r a [d {(x r )a [d] (x r ) d/r ~ 1 , where a [d ](x) = x + a [d] (a;) d_1 . 

r | d,r> 1 

The coefficients of each of the summand series (such as a[d](x)a[ d ]{x) k ~ 1 ) have a closed formula, which can 
be found for instance using the univariate Lagrange inversion formula. From these formulas, we obtain 
the following expression for the number e n [ d ] of d-regular plane trees with n internal nodes: 



°n,[d] 



1 /n+m-2 
m — 1 \ n 



/n +m 2 -l 1 

1 % • „/ + 

\(d-2)|(m-2)/2/ V U 



r > 1 ,r j d,7' | m 
(d-2)|(m-d)/r 



7l r +77l r — 1 



(47) 



^To obtain this equation wc use the formula Zq^iq = — 0(0- 



k/r 
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where 

, , „\ / m — 2 n m — d n — 1 , 

m = n(d-2) + 2, n^^-^^-, nr =_^_ = __ 1 m r = m/r. 

One can extend this formula to any degree distribution on vertices, by adding variables marking the degree 
of each vertex and applying the multivariate Lagrange inversion formula. A general enumeration formula 
is given in [6] using the dissymmetry theorem □ 

Remark 4.3. As the only automorphisms for plane trees are rotations, a simple application of 
Burnside's lemma is enough to get f|47[) . (An adaptation of Burnside's lemma to unrooted plane 
graphs is given in |22|.) In contrast, free trees require more involved counting techniques using 
cycle index sums. Currently, these techniques are Otter's dissimilarity equation, the dissymmetry 
theorem (these two methods being closely related), and now cycle-pointing. 




FIGURE 6. (a) A connected (outerplanar) graph, (b) its block-decomposition, (c) 
the associated Bv-tree. 



4.2. Graphs. We extend here the decomposition principles which we have developed for trees 
to the more general case of a species of connected graphs, by taking advantage of a well-known 
"tree-like" decomposition of a connected graph into 2-connected components. (A 2-connected 
graph is a graph that has at least two vertices and has no separating vertex.) Given a connected 
graph G, a maximal 2-connected subgraph of G is called a block of G. The set of vertices of G is 
denoted 93(G) and its set of blocks is denoted 25(G). The Bv-tree of G is the bicolored graph with 
vertex-set 93(G) U 93(G) and edges corresponding to the adjacencies between the blocks and the 
vertices of G, see Figure It can be shown that the Bv-tree of G is indeed a tree, see [THl p. 10] 
and [M] for details. 

Proposition 15. Let 3 be a species of connected graphs that satisfy the following stability property: 
"a connected graph is in 3 iff all its blocks are in 3 "■ Let 25 be the subspecies of graphs in 3 that are 
2-connected. Then 3 admits a decomposition grammar from the species of 2-connected structures 
25', 25®, and (25')°: 

{s° = x-s' + s®. 
3' = Set o DC, DC = 25' o J£, J£ = X • S', 
g© = 25® © •K + X • Set® © DC, ^ 
D{° = X° * Set o DC + X • Set © DC, DC° = (25')° © Df . 

Hence, if the species of 2-connected structures 25® and 25' are decomposable (the latter implies that 
(25')° is decomposable), then the cycle-pointed species 3° is decomposable as well. More generally, 
if Z%i and Z%® are both solutions of an equation system involving the operations {+,*, o,©} and 
basic cycle-index sums, then Zc,o is also a solution of such an equation system. 
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Proof. The first line of the grammar is obtained as usual by distinguishing whether the marked 
cycle has length 1 or greater than 1. The second line easily follows from the block decomposition, 
as shown for instance in |14) . To wit, the marked vertex of a graph in 9' is incident to a collection 
of blocks, and a connected graph is possibly attached at each non-marked vertex of these blocks. 
Let us prove the third line in a similar way as for free trees (Claim ITU)). Consider a graph G in 9', 
and let T be the Bv-tree of G. Clearly the Bv-tree of a graph has less structure than the graph 
itself, so any automorphism of G induces an automorphism on T. In particular T is a symmetric 
cycle-pointed tree, hence it has a center of symmetry that either corresponds to a block or to a 
vertex of G. The species of graphs in 9® whose center of symmetry in the Bv-tree is a vertex (a 
block) is denoted (9®)v ((9®)b, resp.). Let 9 V (9b) be the species of graphs in 9 with a marked 
vertex (block, resp.) that is colored blue. Then clearly 9v = X • 9' and 9b = 23 o (X • 9')- Hence, 
following the notations introduced in the grammar, 9v = $C = X ■ Set o % and 9b = 23 o "K. Note 
that the structures in (9®)v ((9®)b) are the graphs in 9v® (9b®) such that the center of symmetry 
of the associated Bv-tree is the blue vertex (block, resp.). It is easy to check, in a similar way 
as for free trees, that this property holds only for the graphs of 9v® that are in X • Set® © % 
and only for the graphs of 9b® that are in 23® © r K. Finally, the 4th line, which is necessary to 
have only species of 2-connected structures as terminal species, is obtained from the second line 
by applying the derivation rules (Proposition . □ 

Remark 4.4. Trees are exactly connected graphs where each block is an edge. In other words, 
the species CF of free trees is the species 9 of connected graphs formed from the species 23 = £ 
(the one-element species that consists of the link graph). One easily checks that, in that case, the 
grammar (|48|) for 9 is equivalent to the grammar ([34| for free trees. 

4.2.1. Cacti graphs. Cacti graphs form an important class of graphs that have several algorithmic 
applications. They consist of cycles attached together in a tree-like fashion; in other words, the 
species of cacti graphs arises from the species of 2-connected structures as 23 = £ + CP where L is 
the species of the link graph and CP is the speices of polygons with at least 3 edges (i.e., 23 is the 
species of polygons where one allows the degenerated 2-sided polygon) . 

Thanks to the grammar (|48p , the unlabeled enumeration of connected cacti graphs reduces to 
the calculation of the cycle-index sums for the species of 2-connected structures 23' and 23® (the 
cycle-index sum Z^'y is also required, but it can directly be deduced from Z-%' by differentia- 
tion). Since the 2-connected cacti graphs are polygons, the possible automorphisms are from the 
dihedral group. In addition, the presence of a marked vertex (for 23') or cycle (for 23®) restricts 
the symmetries. For instance, if a structure in 23' has a marked (unlabeled) vertex v, then the 
automorphisms have to fix v; there are only two such symmetries for each polygon, the identity 
and the unique reflection whose axis passes by v. Accordingly, we have two terms in the expres- 
sion of Z-B' below, the first one for the identity, and the second one for reflections (where one 
distinguishes whether the polygon has odd or even length). 



For 23®, all symmetries must be nontrivial and have to respect the marked cycle. These symme- 
tries are of two types: rotation and reflection, which yields the two main terms in the expression 
of Z-%% below. 



The expressions for Z-z> and Z^s can be used to enumerate unlabeled cacti graphs. We just 
have to translate (using the computation rules in Remark |2. II and Remark 1 3. 3 [I the grammar (|48l) 
- applied to the species of cacti graphs - into an equation system satisfied by the corresponding 
ordinary generating functions. 



1 si 1 si + s 2 



2 1 - si 2 1 - s 2 
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Figure 7. Left (Right, rcsp.): the quotient of a dissection under a rotation 
(reflection, rcsp.). 

Proposition 16 (enumeration of unlabeled cacti graphs). The ordinary generating function 
c{x) = ^2 n c n x n of unlabeled cacti graphs counted with respect to the number of vertices satis- 
fies 

xc'(x) = H(x)+I(x), 

K(x r )^ _ 1 H(x) lH{x)+H{x 2 ) 

2\-H{x) + 2 1 - H(x 2 ) ' 



H{x) = .Texp(^— ^i), withK(x) = 



r 

r>l 

^0(r) x r H'(x r ) x 2 H'{x 2 )(H(x) 2 + 2H(x) + l) ^ r ^ 

= L— T3^ + 2(1 -H(x 2 )) 2 +H(x)^xK(x), 

r>2 y ' \ \ ) ) r > 2 

from which one can extract the counting coefficients c n ( after firstly extracting the coefficients of 
H(x)): 

xc'(x) = l- l- x + 2- l-a; 2 +3-2-x 3 +4-4-a; 4 + 5- 9-a; 5 + 6-23-x 6 + 7-63-a; 7 + .... 

4.2.2. Outerplanar graphs. Outcrplanar graphs are graphs that can be drawn in the plane so that 
all vertices are incident to the outer face. They form a fundamental subspecies of the species 
of planar graphs, which already captures some difficulties of the species of all planar graphs; for 
example, the convergence rate of sampling procedures using the Markov Chain approach is not 
known. However, outerplanar graphs are easier to tackle with the decomposition approach. For 
enumeration, we use the well-known property that 2-connected outcrplanar graphs, except for the 
one-edge graph, have a unique hamiltonian cycle. Hence, the species 23 of 2-connected outerplanar 
graphs can be identified with the species of dissections of a polygon (allowing a degenerated 2- 
sided dissection). This time, to obtain the cycle index sums Z-z* and Z%®, we have to count not 
polygons (as for cacti graphs) but dissections of a polygon under the action of the dihedral group. 
We only sketch the method here (the principles for counting such dissections are well known, going 
back to earlier articles of Read [30], see also [3] for more detailed calculations). 

For each type of symmetry (rotation or reflection), one considers the "quotient dissection", as 
shown in Figure [3 Notice that a dissection fixed by a rotation has either a central edge (only for 
the rotation of order two) or a central face. In case of a central edge e, it turns out to be more 
convenient to "double" e, so as to always have a central face before taking the quotient. Thus, 
the quotient dissection has a marked face (the quotient of the central face) that might have degree 
one (only for rotations of order at least three) or two (only for rotations of order at least two). 
Concerning quotient dissections under a reflection, there are two special vertices v\ and V2 on the 
boundary (the intersections of the original polygon with the reflection-axis), and there might be 
some other special vertices, all of degree three, on the boundary path from v± to V2\ sec Figure [7J 

The second ingredient is to take the dual of such quotient dissections in order to obtain plane 
trees, which are easier to decompose and to count. Notice that if the rotation is the identity 
rotation, then the associated plane tree is in the species 5" of plane trees with no vertex of degree 
two. Notice also that each leaf of the tree corresponds to a vertex of the dissection; see Figure \8\ 



The calculations have been done with the help of the computer algebra system Maple. 
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Figure 8. The dual tree of a dissection. 

for an example. The generating function of 5" with respect to the number of leaves satisfies 

F(x) 2 

F(x)=x + -^-L- (49) 
1 — t \x) 

To calculate Zw and Z%®, one computes separately the contributions of rotations and reflec- 
tions to Z%i and to Z^,%. In each case, using duality, the contribution is easily expressed in terms 
of the series F(x). All calculations done, one finds: 

3b' = l(F(s 1 ) + (s 1 +s 2 )P(s 2 )), Z%® = ^J2 ( t>(r)trG(s r ) + ^t 2 (l+s 2 1 Q(s2)+2s 1 R{s2)+S(s 2 )), 
where 



r>2 



m = A m - 1 FW 



1 - F{x) ' w x 1 - 2F{x) ' 
Q(x) = ±P(x), R(x) = ^(xP(x)), S(x) = ^(x 2 P(x)). 

Similarly as for cacti graphs, the expressions for Z%> and Z s ® make it possible to enumerate 
unlabeled connected outerplanar graphs. Translating the grammar (|48[) into an equation system 
on the corresponding generating functions, we obtain the following. 

Proposition 17 (Enumeration of unlabeled connected outerplanar graphs). The ordinary gener- 
ating function o(x) = J2 n °nX n of unlabeled connected outerplanar graphs counted with respect to 
the number of vertices satisfies the system: 

xo'(x) = H(x)+I(x), 

H(x) = a;exp(^^^), with K(x) = l -F{H (x)) + l -(H(x) + H(x 2 )) ■ P(H(x 2 )), 

r>l 

I(x) = lj2<t>(r)x r H>(x r )G(H(x r ))+H(x)J2x r K'(x r ) 

" r>2 r>2 

+ ^x 2 H'{x 2 )(l + H(x) 2 Q(H(x 2 )) + 2H(x)R(H(x 2 )) + S{H{x 2 ))), 

where the series F, G, P, Q, R, S are defined above. One extracts from this system (extracting 
firstly the coefficients in F, Q, R, S, then in H and K, then in I) the counting coefficients o n : 

xo'(x) = 1 • 1 • x + 2 ■ 1 • x 2 + 3 • 2 • x 3 + 4 • 5 • x A + 5 • 13 • x 5 + 6 • 46 • x 6 + 7 ■ 172 • x 7 + . . . . 

4.3. Maps. A map is a planar graph embedded on a sphere up to isotopic deformation, i.e., 
it is a planar graph together with a cyclic order of the neighbors around each vertex. There 
is a huge literature on maps since the pioneering work of Tutte [33] . As we show next, the 
decomposition grammar (|48|) for maps is actually simpler than for graphs, and it allows us to 
enumerate (unrooted unlabeled) 2-connected maps in terms of not necessarily connected maps. 
To write down the grammar, it turns out to be more convenient to take half-edges as atoms instead 
of vertices. Denote by M the species of maps - so := Z ■ M' is the species of rooted maps (maps 
with a marked half-edge) - and by 25 the species of 2-connected maps (the loop-map is considered 
as 2-connected). 
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Proposition 18. The species of rooted maps and symmetric cycle-pointed maps (in each length 
£>2 of the marked cycle) have the following recursive specification over the corresponding species 
of rooted and cycle-pointed 2-connected maps. 

j ft = DC • Seq o DC, % = X-'B'o'K 1 % = X •{! + %), 
| M° (l) = Bfo © JC + Cyc° w © X for t > 2 . 

Proof. The arguments are similar to the proof for graphs ( Proposition 1 15|) . The only difference is 
that one takes the embedding into account, hence corners (which are in one-to-one correspondence 
with half-edges for a given map) play for maps a similar role as vertices do for graphs, and 
a Set construction typically becomes a Cyc construction here. Let us comment here on the 
decomposition for symmetric cycle-pointed maps (the one for rooted maps is well known, see |33j ) . 
One has 

M® = B® © IK + Cyc® © DC, (51) 

where the first (second) term takes account of the maps whose center of symmetry - for the 
associated block-decomposition tree - is a block (vertex, respectively). Further simplification is 
possible, since a rooted map has only the identity as automorphism. Hence, the species of rooted 
maps "K and % satisfy CK° = and X° = 0C%y Thus, Equation (|5"Tj) can be "sliced" into a 
collection of equations, one for each length £ > 2 of the marked cycle. □ 

An important property of any map automorphism - as shown by Liskovets [22) - is that all 
its cycles have the same length t, which is also the order of the automorphism. Hence, the 
number of half-edges of a cycle-pointed map with a marked cycle of length £ is divisible by £. 
For £ > 1, denote by M^(x) (B^(y)) the series counting unlabeled cycle-pointed maps (2- 
connected maps, respectively), according to the number of half-edges, divided by £. In particular, 
R(x) := M(!)(x) and S(x) := B^{x) are the series counting rooted maps and rooted 2-connccted 
maps, respectively. We clearly have 

Given this simplification, the grammar (|50p is translated into the following system relating the 
series counting species of maps and species of 2-connected maps: 

R & = 1 K[ K< V H(x)=x(l + R(x)), K(x) = -^S(H(x)), (52) 
1 — A (x) H (X) 

M W (x) = ^M Bw (H(x)) + m **'^ x) iov£>2. (53) 

In the case of maps, the decomposition grammar is used in the other direction, i.e., one obtains 
the enumeration of (unrooted) 2-connected maps from maps. Indeed, unconstrained maps are 
easier to count, by a method of quotient [22| similar to the one we have used for counting dissections 
in Section HX1 

Let us first review (from Tutte |33j ) how one obtains an expression for the series S(y) counting 
rooted 2-connectcd maps from an expression for the series R(x) counting rooted maps. One starts 
from the following expression of R(x): 

R(x) = ^ 2 - 9 fJ , with = p(x) specified by p = x 2 + 3/3 2 . (54) 

(i - spy 

Next, notice that the change of variable y = H(x) = x(l + R(x)) between rooted maps and rooted 
2-connected maps is such that y 2 is also rational in /3: 

Equivalently: 

P 



y = 77(1 — 77) , where 77 := 



1-3,3' 
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so the dependence between 77 and (3 is invertible: f3 = 77/(1 + 3?y). Notice also from (|52j) that 

= ^-K(x) = (1 + = R(x) = ^J*® - 

Replacing j3 by 77/(1 + 377), one gets 

V 2 

S(y) = 77(2 - 3?7), with 77 = 77(7/) specified by 77 = — , 

(1 - rj) 2 

which can also be written as 

S(y) = s (y 2 )> witn S(y) = ?7(2 - 377), and 77 := 77(77) specified by rj = _ _, 2 . (55) 

In a similar way, if two series fix) = g(y) are related by the change of variables 77 = H (x) and 
if f(x) is rational in /3(x), then g(y) is rational in 77(7/) (replacing /3 by 77/(1 + 37/)). For instance, 
for I > 3, it has been shown by Liskovets using the quotient method (see |12j for the reformulation 
on series) that 

Since H{x)/x = 1 + i?(x) and i£"(cc) = i£(ai)/(l + R{x)) are rational in /3, as well as xH'(x) and 
xK'[x) (noticing that xAf/dx = 2x 2 df/dx 2 = 2/3(1-3/3) • (d//d/3)/(dz 2 /d/3)), one finds from (53) 
a rational expression in (3 for the series Bu\(H(x)). Replacing (3 by 77/ (1 + 377) in that expression, 
one finally gets: 

which can also be written as 

1 - 3rj(y) ' 

In a similar way, starting from the expression (given in |12j ) 

1 - 2f3{x) 2 

(2){X) ~ + (1 - 6/3(x))(l - 3/3(z)) + X ' (1 - 6p(x))(l - 3/3(.t)) ' 
one obtains the following expression for B(o,\(y): 

B{2){V) = 1 - 377(77) +y I-3^)' 

which can also be written as 

B( 2) (y) - P(y 2 ) + yQ(y 2 ), with p(„) = v{v ^~^f and Q( y ) = ^r^y ( 57 ) 

Proposition 19 (counting unrooted 2-connected maps, recover [23]). The number t n of (unrooted 
unlabeled) 2-connected maps with n edges satisfies: 

tn = 2^ ( Sn + Un + 2 ^ 0(7i/fc)-(9fc 2 -9fc + l)s fe ), (58) 

2(3n-3)! 77(77+1) . (377 - 4)n . 
wftere s„ = n ^ 2rt _ [v > M ™ = 2 s ("+ 1 )/ 2 ^ n ls odd ' and Un = § s ™/ 2 l * n ls even - 

Proof. Cycle-pointing ensures that the generating function 25° (y) = ^2 n 2nt n y 2n satisfies 
%°(y) = S(y) + B [2) {y 2 ) + £ B (£) (i/) = S(y) + (B [2) {y 2 ) - G(y 4 )) + £ <fi{l)G{y 21 ). 



B (t) (y) = cj>(£)G(y 2 ) for I > 3, with G(y) = , ZL - ( 56 ) 



e>3 e>2 
Extracting the coefficient [y 2n ] in this equation yields 

4>{n/k)v k , (59) 

£>2,l\n k\n,k^n 
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where s n = [y n }S(y), «„ = [y n }G(y), u n = [y n ](B [2) {y) - G_(y 2 )), which is [y n ' 2 ]P{y) - v n/2 if n is 
even and y lyU ~ 1 ^ 2 Q{y) if n is odd. Notice that the series S(y), G(y), P{y), and Q(y) are rational 
in the simple series rj(y) = y/(l — rj{y)) 2 ■ Hence the Lagrange inversion formula [H Section 3.1] 
allows us to extract exact formulas for the coefficients s n , v n , and u n . Substituting these exact 
expressions in (|59[) . one obtains the announced formula for t n . □ 

The enumeration of unrooted 2-connected maps has first been done by Liskovcts and Walsh [23) 
using the quotient method in a quite involved way. More recently, the counting formula has been 
recovered in |12j using a method of extraction at a center of symmetry on quadr angulations. What 
we do here is equivalent to [12] , but the cycle-pointed framework allows us to write the equations 
on generating functions in a more systematic way. 

4.4. Asymptotic enumeration. Cycle-pointing makes it possible to easily obtain an asymptotic 
estimate for the coefficients counting the number of unlabeled structures from a species, provided 
that the singular behavior of the OGS counting the associated species of rooted unlabeled structures 
is known. 

We illustrate the method on free trees. Let R(x) be the OGS of rooted unlabeled nonplane trees, 
which is specified by R(x) — £cxp(^ i>:l R(x l )/i). It is well known that R(x) has a dominant 
singularity p < 1 of the square-root type [TUJ VII. 5]. That is, in the slit complex neighborhood 
D e := {x | x — p (fc R+ and \x — p\ < e} we have the expansion 



R(x) = 1 -aX + o(X), where X = y/l-x/p, (60) 
which yields - using transfer theorems of analytic combinatorics |1Q[ VI] - the asymptotic estimate 

R n ~ cn- 3/2 p~ n , where c = « 0.43922, p w 0.33832, (61) 

2v% 

for the number of rooted unlabeled nonplane trees with n vertices. 

To obtain a similar estimate for free trees, we consider the OGS P{x) of cycle-pointed nonplane 
trees and start from the expression of P{x) obtained in Proposition [Til 

P(x) = x 2 R'(x 2 ) + (1 + x e R'(x e ))R(x). 

e>2 

Notice that, since p < 1, the series A := x 2 R'(x 2 ) and B := 1 + X^>2 x R'(ar) are analytic at 
x = p, and the value at x = p of B is the positive constant 

b:=l + J2p e R'(p e )- (62) 

l>2 

Therefore, from the singular expansion (j60[) of R{x), wc obtain 

P(x) = P(p) -abX + o(X) . (63) 

Let us simplify further the positive constant b = B(p). First, by deriving the equation that 
specifies R(x), one obtains xR'(x) = R(x)(l + J2i>i x t R'(x i )), which yields 

B(x)R(x) = xR'(x)(l - R(x)) . 

By deriving the singular expansion of i?(x), one obtains 

R'(x) = -^-X- 1 +o(X- 1 ), 
2p 

hence xR'(x)(l — R{x)) converges to a 2 /2 as x — > p, i.e., b = a 2 /2. 

Proposition 20 (asymptotic enumeration of free trees). The number F n of unlabeled free trees 
with n vertices satisfies 

F n ~ (2irc 3 )n- 5 / 2 p- n 7 (64) 

where c is the constant and p^ 1 is the growth ratio in the estimate ([61[1 for rooted nonplane trees 
(R n ~cn- 3 / 2 p- n ). 
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Proof. From the singular expansion (|63[) of P(x) we obtain (again by the transfer theorems of 
singularity analysis) 

z«[P(z)] ~ ^n-^p- . 

Using c = a/(2y/Tr) and b = a 2 /2, we have ab/(2y / n) = 2irc 3 . Finally, Theorem \b\ (unbiased 
pointing) yields F n = ±[x n ]P(x), so F n ~ (27rc 3 )n~ 5 / 2 p~ n . □ 

It is also possible to get the estimate of F n from Otter's dissimilarity equation (or from the 
dissymmetry theorem). However, we find that cycle-pointing provides a more transparent expla- 
nation why the asymptotic estimate of the coefficients F n counting unlabeled structures from an 
unrooted "tree-like" species 3" is of the universal type cn~ 5 / 2 p~ n . The argument is very simple: 

• The OGS P(x) of the cycle-pointed species 3^° is positively expressed in terms of the 
OGS of the rooted species, which has a square-root dominant singularity. Therefore, P(x) 
inherits the same singularity and singularity type (square- root). 

• Transfer theorems of singularity analysis ensure that a square-root dominant singularity 
yields an asymptotic estimate in cn~ 3 / 2 p~" for the coefficients [x n ]P(x). Since F n = 
^[x n ]P(x), one gets F n ~ cn~ 5 / 2 p~ n . 

This strategy applies to all species of trees encountered in this section, as well as to cacti graphs 
and connected outerplanar graphs (in all cases one starts from the singular expansion of the OGS 
counting the corresponding rooted species). 



5. Application to Random Generation 

Recently, so-called Boltzmann samplers have been introduced by Duchon et al [9] as a general 
method to efficiently (typically in linear time) generate uniformly at random combinatorial struc- 
tures that admit a decomposition. In contrast to the more costly recursive method of sampling |25j , 
which is based on counting coefficients of the recursive decomposition, Boltzmann samplers are 
primarily based on generating functions. Until now Boltzmann samplers were developed in the 
labeled setting [9] and partially in the unlabeled setting [13] . 

In this section we provide a more complete method in the unlabeled setting. In order to deal 
with the substitution construction and the cycle-pointing operator (which are not covered in [T3"] ). 
we have to describe samplers not solely based on ordinary generating functions, but on cycle index 
sums - also known as Polya operators. Therefore we call these random generators Poly a- Boltzmann 
samplers. 

With these refined samplers we are able to design in a systematic way (via specific genera- 
tion rules) a Polya-Boltzmann sampler for species that admit a recursive decomposition, thereby 
allowing in the decomposition all operators that have been described in this article. When spe- 
cialized suitably, a Polya-Boltzmann sampler reduces to an ordinary Boltzmann sampler, hence it 
provides a uniform random sampler for species of unlabeled structures. In particular, we obtain 
highly efficient random generators for the species in Section |4j for trees, cacti graphs, outerplanar 
graphs, etc. 

5.1. Ordinary Boltzmann Samplers. Let A be a species of structures, and let A(x) be the 
ordinary generating series for A. A real number x > is said to be admissible iff the sum defining 
A{x) converges {x within the disk of convergence of the series). Given a fixed admissible value 
x > 0, an ordinary Boltzmann sampler for unlabeled structures from A is a random generator 
TA(x) that draws each structure 7 6 .A with probability 

p x ( 7 ) = . (65) 

A{x) 

Notice that this distribution has the fundamental property to be uniform, i.e., any two unlabeled 
structures of the species with the same size have the same probability. 
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5.1.1. Automatic rules to design Boltzmann samplers. As described in j9j, there are simple rules 
to assemble Boltzmann samplers for the two classical constructions Sum and Product (Bcrn(p) 
stands for a Bernoulli law, returning "true" with probability p and "false" with probability 1— p). 

e=A + T>. re(x): if Bem(A(x) /Q(x)) return TA{x) else return T%(x) 
e = A • TG{x): return (TA(x),T'B(x)) {independent calls} 

These rules can be used recursively. For instance, the species T of rooted binary trees satisfies 

7 = X + 7 ■ T, (66) 
which translates to the following Boltzmann sampler: 

rT(a;): if Bern f^f^J return leaf else return (T7(x), node, T7(x)). 

5.1.2. The complexity model. Typically, when J has a recursive specification over the species 
A\, . . . ,Ai, then our sampling procedure TS^x) will require that we can evaluate the ordinary 
generating functions for A\, . . . ,Ai at real values x. Indeed, for a species 3 defined as A\ + A-2, 
each Bernoulli choice requires to draw a uniform value in [0, 1] and compare it with a ratio of 
the form Ai(x)/3 r (x). In the following we work with the complexity model where we assume that 
there exists an oracle that provides at unit cost the exact values of these generating functions at 
x, and that a random number in [0, 1] can be generated and compared with a fixed value such as 
A(x)/G(x) in constant time as well. We will refer to this complexity model as the real- arithmetic 
complexity model in the following. 

The model is justified since in many applications we obtain expressions for the ordinary gen- 
crating scries that allow a rapid numeric evaluation of those series at given values, for example 
with the Newton method. Then, in practice, one works at a fixed precision, say TV bits (typically 
TV = 64, correspondingly roughly to 20 decimal digits). 

Let us mention that the Boltzmann samplers for the constructions Multiset and Cycle — as 
given in |13j and recovered in a more general framework here — require typically the values of 
the generating functions not only at x, but at all powers x l . Since combinatorial species often 
have exponential growth rate (which is the case for all examples presented here), the dominant 
singularity p satisfies p < 1, hence x < p < 1. Therefore the values A(x l ) decrease exponentially 
fast with i. When working at fixed precision of N bits, one can thus discard the powers greater than 
k = iV/log 2 (l/p) and assume that the oracle provides the evaluations of the generating functions 
at X j X j . . . . X . For a more detailed study and implementation of the evaluation procedures we 
refer to the recent article by Pivoteau, Salvy, and Soria [27] . 

Proposition 21 (Duchon et al. [5]). Let 'J be a species that can be decomposed recursively from 
{1, X} in terms of the constructions {+, •} (this is meant analogous to but more restricted than 
Definition \2.7\ ). Then one can obtain in a systematic way (from the recursive specification) an 
ordinary Boltzmann sampler TJ(x) for 5 ' . In addition, in the real- arithmetic complexity model, 
TJ(x) operates in linear time in the size of the output. 

This result was recently extended in [T3] to other constructions, such as the Multiset and 
Cycle constructions (and their counterpart with fixed number of components). In this section 
we extend this result to the substitution construction, and to the cycle-pointed constructions. It 
follows that any species that is cycle-pointed decomposable over species where we already have a 
Polya-Boltzman sampler also has a Polya-Boltzman sampler. 

Remark 5.1. Note that in the general results on sampling we consider species up to isomorphism. 
Theoretically, this is a necessary assumption, since isomorphisms between species might in artificial 
examples be non-effective. However, in all the presented examples and applications in this article, 
the isomorphisms between species are straightforward and efficiently computable. The effectiveness 
of isomorphisms between the output species of the sampling procedures and the actual species are 
actually more related to the question how combinatorial structures are represented on a computer, 
and in particular they do not concern the complexity of the sampling task itself. 
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5.1.3. Targeting Boltzmann samplers. Boltzmann samplers often lead to very efficient exact-size 
and approximate- size random samplers. In order to draw unlabeled structures uniformly at random 
from a species A at (in case of exact-size sampling) or around (in case of approximate-size sampling) 
a target-size n, one simply repeats calling the Boltzmann sampler TA(x) - with a suitably chosen 
value of x - until the size of the output is n (exact-size sampling) or is in [n(l — e),n(l + e)] 
(approximate-size sampling), where e is a tolerance-parameter fixed by the user. It turns out that 
for a wide class of species, which covers the species encountered in Section [3] (trees, cacti graphs, 
outerplanar graphs), this method works very well, as proved in [3]. 

Proposition 22 (Duchon et al [3]). Let J be a species such that asymptotically 

|J„|~cn- 3 / 2 p~ n 

for some positive constant c, and where p is the radius of convergence of ^(x), assuming that 'J(p) 
is convergent Also suppose that there is a Boltzmann sampler T3(p) at x = p such that the cost 
of generating a structure is linearly bounded by the size of the structure all along the generation 
process. ThenT3 : {p) yields an exact-size (approximate- size, resp.) sampler for unlabeled structures 
from 5" with expected complexity 0(n 2 ) (0{n/e), resp.), where n is a target-size and e is a tolerance- 
ratio. 

The exact-size and approximate-size samplers are obtained by running T3 r (p) until the size of 
the output is in the target domain Q n (that is, Q n = {n} for exact-size sampling, and f2„ = 
[n(l — e), n(l + e)] for approximate-size sampling). To obtain the stated complexity, it is necessary 
that the generation of too large structures is aborted as soon as the size of the generated object 
gets larger than Max(f2 n ). 

5.2. Polya-Boltzmann Samplers for classical species. Let A be a species. Recall that a 
symmetry on a species A is a pair (A, a) where A € A and a is an automorphism of A. A 
symmetry has a weight-monomial W(A.a), as defined in (|3]); and the cycle index sum Zj\[s\, S2, ■ ■ •) 
is the sum of the weight-monomials over all the symmetries on A. Similarly as for the one- 
variable case, a vector (si)i>i of nonnegative real values is said to be admissible if the sum 
defining Zj\,{sy, S2, ■ ■ •) converges. Given an admissible vector (sj)j>i, a Polya-Boltzmann sampler 
is a procedure TZji(si, S2, ■ ■ • ) that randomly samples symmetries on A such that each symmetry 
(A, a) is drawn with probability 

P (3l , S2 ,...)(A,a)= (67) 
Z A (si, s 2 , ■ ■ ■) 

where the weight-monomial W(A,a) is evaluated at (s\, S2, ■ ■ ■)■ This probability distribution is 
called the Polya-Boltzmann distribution for A at (si)i>i. The following simple lemma ensures 
that Polya-Boltzmann samplers are a refinement of ordinary Boltzmann samplers, in the same 
way as cycle index sums are a refinement of ordinary generating functions. 

Lemma 23 (Polya-Boltzmann samplers extend ordinary Boltzmann samplers) . Consider a species 
A having a Polya-Boltzmann sampler TZji(si)i>i. Then, for any value x admissible for A(x), the 
sampler TZj±(x,x 2 ,x 3 , . . .) is an ordinary Boltzmann sampler for unlabeled structures from A at 
x. 

Proof. The generator TZji(x, x 2 , . . .) gives weight x n j(n\Zj\{x, x 2 , . . .)) to each symmetry of size 
?i. Since Zji(x, x 2 , . . .) = A{x) by Lcmma[TJ this weight simplifies to x n /(n\A(x)). In addition, 
we have seen in Lemma [T] that each unlabeled structure 7 € A n gives rise to n\ symmetries. 
Hence, each unlabeled structure of size n has weight x n /A(x) when calling TZji(x, x 2 , . . .), i.e., 
TZji(x, x 2 , . . .) is an ordinary Boltzmann sampler for unlabeled structures from A. □ 

In the next two sections, we describe Polya-Boltzmann samplers for unlabeled structures from 
basic species and for the constructions {+, •, o}. Note that the output of a Polya-Boltzmann sam- 
pler for a species A consists of a species from A[n] together with an automorphism on that struc- 
ture. In all the random generators to be described (as well as in the procedures for cycle-pointed 



^The asymptotic behaviour cp n n 3 / 2 is called universal [T], as it is widely encountered in combinatorics. 
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species), the resulting structure S is made well- labeled by applying a procedure DistributeLa- 
bels that substitutes [1, . . . , for the atoms of S uniformly at random (i.e., according to a 
permutation of size \S\ taken uniformly at random). 

We also assume to have generators for classical distributions: 

• Geom(p) returns an integer under the geometric law of parameter p £ [0,1]: Pr(fc) = 
p k -\l-p); 

• Pois(A) returns an integer under the Poisson law of parameter A: Pr(fc) = e~ x X k /kl; and 

• Loga(A) returns an integer under the distribution Pr(fc) = (log(l/(l — \)))~ 1 A k /k; this 
distribution we call Logo, law of parameter A. 

The generators for those distributions (more generally, any generator for an explicit distribution 
on integers) can be easily obtained from the "inversion method" [5J §2.1] and [5TJ §4.1]). 

5.2.1. Polya-Boltzmann samplers for basic species. At first, let us describe Polya-Boltzmann sam- 
plers for the basic species Seq, Set, Cyc, and their counterparts SeqJ^, Set^, Cyc^. In each 
case, the design of the sampler is guided by the expressions of the cycle index sums for the basic 
species as given in Figure [TJ 

Proposition 24. The random generators shown in Figure [P| are Polya-Boltzmann samplers for 
the corresponding species. 

Proof. For Seq, the proof is easy. Since Zs ET = J2k>i s i> tne probability of a sequence to have 
size k must be Si/Zs ET , i.e., the size distribution is a geometric law of parameter s\. 

For Set, observe that the sum of weight- monomials over all symmetries of type (ni, ri2, ■ ■ ■ , Uk) 
in Z Set is 

a? 1 (s 2 /2)" 2 (s k /k) n - 
ni! n 2 l n k l 

Therefore, a Polya-Boltzmann sampler has to draw a collection of cycles such that the number rii 
of cycles of length i follows a Poisson law of parameter si/i for i > 1, and the m's arc independent. 
This is precisely what the algorithm rZg ET in Figure |H] does, upon choosing a priori the size of the 
largest cycle to be drawn. 

For Cyc, the argument is similar. As we have seen in Section T2.6.1[ the sum of the weight- 
monomials over all the symmetries of order r is 



1 - S r 

(r) 

Therefore the order of the automorphism has to be chosen with probability Zq^ c /Zcyc- ln addition, 
for each fixed order r, the probability of the cycle being r x k has to be s*/k/ log(l/(l — s r )), i.e., 
the size (divided by r) has to follow a Loga law of parameter s r . Finally, for all automorphisms 
of size r and size r x k, all possible 'rotation angles' (there are 4>{r) possibilities) have to be 
equiprobable. This is exactly what the generator YZqyc does. 

The proof that the generators for the species SeqJ^, Set'^, and Cyc'^ are Polya-Boltzmann 
samplers follows similar arguments upon restricting to structures of size k. □ 

5.2.2. Polya-Boltzmann samplers for combinatorial constructions. As shown in Figure [TUl Polya- 
Boltzmann samplers make it possible to have a simple sampling rule for each of the standard 
constructions; that is, sampling rules not only for sum and product, but also for the substitution 
construction. 

Proposition 25. Let 6 = A A 25, with A € {+, ",0}. When there are Polya-Boltzmann samplers 
for A and 23, then there is also a Polya-Boltzmann sampler TZq(si, S2, . . .) for C that can be 
constructed from the samples for A and 25, as given in Figure [751 

Proof. For 6 = A + 25, the proof is easy. Note that Sym(.A + 25) = Sym(A) + Sym(25). As shown 
in [5] (for the standard weight x n /n\), a disjoint union yields a Bernoulli switch on Boltzmann 
samplers, with probability corresponding to the ratio of the series for A divided by the series for 6 
(this argument works as well here, where we take the refined weight s™ 1 s 1 ^ 2 . . . s^ k /nl). Therefore 
the probability of the Bernoulli switch has to be Za/Zq. 



32 



MANUEL BODIRSKY, ERIC FUSY, MIHYUN KANG, AND STEFAN VIGERSKE 



(1) Sequence. 

Algorithm TZseq(si, s 2 , ■ ■ • ), with si < 1: 
k Geom(si); 

return a sequence of k atoms (endowed with the identity-automorphism) 



(2) Set. 

Define the probability distribution relative to (s;);>i: 



Pr(K < k) = - — y-^- r TT exp ( ± 



Let Max_Index(si, S2, . . .) be a generator for this distribution. 

Algorithm FZset(si, s 2 , . . .) : 

J Max_Index(si, s 2 , ■ ■ •); 

for j from 1 to J — 1 do kj <— Pois ^-j- j end for 

kj Pois>i (^r-) {Poisson conditioned to output a strictly positive integer} 
return a collection of cycles of atoms where there are kj cycles in each length j > 0. 



(3) Cycle. 

Given (si)>i such that Zcyc(si, S2, ■ ■ ■) converges, consider the probability distribution 

Pr(7? = r) = r -t ^ log (1/(1 - s P )) for r > 1. 

Let ReplicOrder(si, S2, . . .) be a generator of this distribution. 

Algorithm FZcyc(si, S2, ■ ■ ■) 

r i — ReplicOrder(si, S2, . . •); 
j < — Loga(s r ); 

Draw an integer b £ [l..r — 1] that is relatively prime to r uniformly at random ; 
return the cycle of length j x r endowed with the automorphism: 

"each atom is mapped to the atom that is j X b units further on the cycle" . 



(1') Sequence of size k. 

Algorithm rZ SEQ[k] (si, s 2 , . . .) : 

return a sequence of k atoms (endowed with the identity-automorphism) 



(2') Set of size k. 

Algorithm rZ SEj[k] (si, s 2 , ■ ■ .) : 

s "l s ' l 2 g n k . [ s «4 s nk ]Z [fcl 

Draw a partition sequence it of order k such that Pr(-7r) = — - — - — - — - — - — — — — — 

2g ET w(si,S2, ■ ■ ■) 

return a collection of ni cycles of length 1, n-z cycles of length 2, . . ., n k cycles of length k. 



(3') Cycle of size k. 

Algorithm FZ CYc[k] (s 1 , s 2 , ■ ■ ■) : 

4>(r)Sr /r 



Draw a divisor r of k with distribution Pr(r) 



^CyoW( Si ' S2 '--0 
Draw an integer b £ [l..r — 1] that is relatively to r uniformly at random; 

return the cycle of length k endowed with the automorphism: 

"each atom is mapped to the atom that is kb/r units further on the cycle" 



Figure 9. Polya-Boltzmann samplers for basic species. In all these random 
samplers, the output structure is made well-labeled using the procedure Dis- 
tributeLabels. 
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Sum: 6 = A + "B. 




Given (sj)j>i such that Z a := Za{s\, s 2 , ■ ■ •) and Z(, := Z<z{si, s 2 , ■ • ■) converge: 


AlfforitViTn P^/of.si so 1 * 




it riern^Z a /(Z a + Zb) j then return 1 Za[si, s 2l . . .) 




else return r^jj(si, S2, .. .) end if 




Product: C = A • 23. 




Given (s^)^ i such thcit Zj± (si , #2 5 ■ • ■ ) £md (si j -52? • ■ •) converge! 




Algorithm rz" e (si,s 2 , ■ • ■) : 




return (TZa(si, s 2 , ■ ■ ■), TZ%(si, s 2 , . . .)) {independent calls} 




Substitution: 6 = A o 23. 




Given (si)i>i such that := Z%(si, s 2 i, S31, • ■ •) converges for each i > 1, 




and Zji(bi, 62, &3, ■ • ■) converges: 




Algorithm TZe(si,s 2 , ■ ■ .) : 




Compute (A,cta) <- rz"yt(&i, 6 2 , h, ■ ■ ■)', 




for each cycle C = (iti, . . . , iik) of er^ («i has smallest label in 


C) do 


Compute (B, a B ) <- TZ^(s k , s 2 k ■ ■ ■); 




Replace each atom of C by a copy of B; 




for each cycle D of <jb do 




Let Ed be the cycle composed of the copies of D at u\ , . 




end for 




end for 




return the resulting structure and the automorphism consisting of 




the composed cycles Ed 





Figure 10. The rules to specify a Polya-Boltzmann sampler for a species that 
has a recursive specification. In all these random samplers, the finally returned 
structure is made well-labeled using the procedure DistributeLabels. 



For product, 6 = A-H, Sym(C) is like a partitional product of Sym(yi) and Sym(23). Therefore, 
a Boltzmann sampler classically consists of two independent calls to Boltzmann samplers, as shown 
in [5] (again, for the standard weight x n /n\). All the arguments work the same way for the refined 
weight s™ 1 ,?™ 2 • ■ • s k k Therefore, one has to call independently a Polya-Boltzmann sampler for 
A and a Polya-Boltzmann sampler for 23. 

For substitution, 6 = A a 23, recall (from Section [2. 6.21 Equation (|13[0 that for each partition 
sequence n, the sum of the weight-monomials over all the symmetries on C of type ir = (n\, . . . , n n ) 
satisfies the expression 

z (n) = ^ 6 «i & «2 _ _ _ fe »„ ) where ^ = n , [s m . . . s ^]Z A ( Sl , s 2 , . . .) and b t = Z^( Si ,s 2i , . . .). 

Hence, a Polya-Boltzmann sampler for C must draw the core structure following the Polya- 
Boltzmann distribution for A with parameters (pi,b 2 ,...). In addition, as discussed in Sec- 
tion [2T621 once the type tt of the core symmetry is fixed, the structures substituted at the cycles 
of the core-automorphism form a partitional product of the form 

Sym(23) ri1 • {Sym(23) duplicated}™ 2 •(••■)• {Sym(23) replicated k times}"". 

Recall that a partitional product yields independent Boltzmann samplers. Hence, once the core- 
automorphism (A, a a) is drawn, the symmetries in 23 that are substituted at each cycle of a a must 
be independent calls of a Polya-Boltzmann sampler for 23, and the parameters of the sampler must 
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be (sj, S2i, ■ ■ ■) if the replication order is i (i is also the length of the cycle of the core structure 
where the substitution occurs). This is precisely what the generator TZq does. □ 

5.2.3. Polya-Boltzmann samplers for decomposable species. The random generation rules shown 
in Figure for basic species and Figure [TU] for general constructions can be combined to design a 
Polya-Boltzmann sampler for species that are decomposable over those basic species. 

Definition 5.1. A Polya-Boltzmann sampler TZji(si, S2, ■ • •) is called linear if, in the real- 
arithmetic complexity model described above, the number of computation steps for generating 
a structure from A is linearly bounded by the size of the structures all along the generation 
process. 

Theorem 26. Let A be a species that is decomposable over species Ai, . . . ,Ai (Definition \2. 7| |, 
each having a linear Polya-Boltzmann sampler. Then also the species A has a linear Polya- 
Boltzmann sampler TZji(s\, S2, ■ ■ ■)• 

Proof. Given Proposition 1251 the Polya-Boltzmann sampler for A is straightforward by 'unrolling' 
the recursive specifications that show that A is decomposable; so we are left with the task to show 
that we also have a linear Polya-Boltzmann sampler for A. 

Our argument is based on the following concept. The decomposition tree for structures A from 
A is defined as follows. The leaves of this tree are structures from the species A%, . . . ,Ai. The 
other vertices of this tree are structures from the species %f } that appear in the definition of 
recursive specifications (Definition 12. 6|) . Suppose that A is specified by variable x s in the recursive 
specification, and that n is the size of A. The root of the decomposition tree is A itself, which is 
a structure from Xi™^ . To define the children of a vertex A' from X^ in the decomposition tree, 

consider the species X^ ^ for which Xk appears in the expression ej in the recursive specification. 
The children are the structures of those species from which A has been constructed. The size of 
a decomposition tree is the number of its internal (i.e., non-leaf) vertices. 

Let A be a structure from A, and let Ai, . . . ,A t be the leafs of the decomposition tree for A. 
Note that the cost of the sampling procedure for producing A is linearly bounded by the size of the 
decomposition tree for A plus the total cost for sampling Ai, . . . ,A t . The size of A is distributed 
over all structures Ai, . . . , A t from terminal classes, consequently \A\ = \Ai \ + • • • + \ A t \. Since the 
samplers for the terminal classes are linear, we conclude that the total cost for sampling A\, . . . , A t 
is linear in |j4|. 

We finally show that the size of the decomposition tree for A is linear in the size n of A. 
More specifically, when m is the number of variables in the recursive specification of A, then the 
size of the decomposition tree is bounded by 2mn. Otherwise, there would be a branch in the 
decomposition tree that contains two vertices B\ from X^ and B-2 from xj™ 2 ' (corresponding to 
the same variable Xj) such that B\ and Bi have the same size n. But then, by looping through 
the corresponding cycle in the decomposition grammar, we can produce infinitely many distinct 
structures of size n from the recursive specification, contradicting the fact that Xj [n] is finite. The 
details of this argument are easy and left to the reader. □ 

Consequently, the rooted species we have encountered (i.e., with a single marked atom) in 
Section 0] can be endowed with efficient random samplers. 

Proposition 27. In the real- arithmetic complexity model (oracle assumption), the following 
species admit an exact-size sampler of expected complexity 0(n 2 ) (n being the target-size) and 
an approximate- size sampler of expected complexity 0(n/e) (e being the tolerance-ratio) for the 
uniform distribution on unlabeled structures: rooted nonplane (plane, resp.) trees, rooted non- 
plane (plane, resp.) trees whose node degrees are constrained to lie in a finite integer set Q, rooted 
cacti graphs, rooted connected outerplanar graphs. 

Proof. All the rooted tree species stated here are decomposable over the discussed basic species, 
as we have seen in Section 01 (For instance, the species 5F' of rooted nonplane trees satisfies 
J 1 = Set o (X ■ J 7 ).) Hence, by Theorem 121)1 the species of rooted trees can be endowed with a 
linear Polya-Boltzmann sampler. A species S' of rooted connected graphs - provided that it is 
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closed under taking 2-connected components - is decomposable in terms of the subspecies 23' of 
rooted 2-connectcd graphs: 

s' = SETo(s'o(x-g')). 

As we have explained in Sections 14.2.11 and 14.2.21 there is a decomposition strategy for rooted 
2-connected cacti graphs (rooted polygons) and rooted 2-connected outerplanar graphs (rooted 
dissections of a polygon). The linear Polya-Boltzmann sampler TZ^i yields in turn a linear Polya- 
Boltzmann sampler for 3' (using the specification of 3' in terms of 23' stated above). Hence, each 
of the rooted species stated above has a linear Polya-Boltzmann sampler, which becomes a linear 
ordinary Boltzmann sampler when specialized to Sj = x l . 

Moreover, all these rooted species obey the universal asymptotic form c p~ n n~ 3 / 2 , as shown 
in [55] for trees, in [3T] for cacti graphs, and in [3] for outerplanar graphs. Hence, by Proposition!^ 
a Boltzmann sampler run at the dominant singularity yields an exact-size (approximate-size, re- 
spectively) sampler with expected complexity 0(n 2 ) (0(n/e), respectively). □ 

5.3. Polya-Boltzmann samplers for cycle-pointed species. 

5.3.1. Definition. Given a cycle-pointed species ft, a vector (sj,i,),->i of nonncgative real values 
is said to be admissible if the sum of weight-monomials defining Zy converges when evaluated at 
this vector. Given a fixed admissible vector (sj,ii)j>i, a Polya-Boltzmann sampler is a procedure 
TZy(si,ti)i>i that generates a rooted c-symmetry on ft at random such that each rooted c- 
symmetry (P, a, v) of ft(ft) is drawn with probability 

F(P,a,v)= 7 P 7 } (68) 

with W(p a v j as defined in (|15[) . This probability distribution is called the Polya-Boltzmann distri- 
bution for ft at (s^ ti)i>\. Similarly as for classical species, the procedure of calling YZy(x % , x l )i>± 
(where x is admissible for the ordinary generating function ft(x)) and then returning the underly- 
ing unlabeled structure yields an ordinary Boltzmann sampler T7(x). The following sampling rules 
make it possible to systematically assemble Polya-Boltzmann samplers for cycle-pointed species. 

5.3.2. Polya-Boltzmann samplers for basic cycle-pointed species. 

Proposition 28. The random generators shown in Figure [771 are Polya-Boltzmann samplers for 
the corresponding basic cycle-pointed species. 

Proof. The arguments are very similar to the ones in the proof of Proposition 1241 Observe that 
Seq° = (X° * Seq) * Seq. The marked atom (the atom bearing the marked cycle, which has 
length 1 here) must be preceded by a sequence of fci atoms and followed by a sequence of k2 atoms 
such that k\ and &2 follow independently a geometric law of parameter s\. 

Next, we have Set°^ = 6^ * Set where 6^ is the cycle-pointed species of cycles of £ atoms 
(the cycle being marked), which explains the samplers TZ$ ET o and TZ Set ® in Figure [TT] 

A cycle-pointed structure in Cyc^ consists of £ isomorphic copies (attached cyclically in a chain) 
of an object in X° * Seq. Additionally, one needs to specify the shift of the automorphism; if the 
cycle has length n£, the possible shifts are n ■ i where i <E [l..£] is relatively prime to £, hence there 
are <j>(£) possibilities for the shift. 

The proofs for the samplers with k components follow similar arguments. □ 

5.3.3. Polya-Boltzmann samplers for cycle-pointed constructions. 

Proposition 29. Let 5i be a species with a recursive specification over other species having a 
Polya-Boltzmann sampler. Then the random sampler TZ^{s\, t\; 8%, ti\ . . .), as given in Fiaure \TB. 
is a Polya-Boltzmann sampler for Jl. 

Proof. The arguments are very similar to the ones in the proof of Proposition 1251 For the cycle- 
pointed sum, ft = y + Q, wc have RSym(ft) = RSym(ft) + RSym(Q). Therefore the Polya- 
Boltzmann sampler has to be a Bernoulli switch with probability Zy/Zji followed by a call to 
the Polya-Boltzmann sampler of either ft or Q (depending on the Bernoulli ouput to be "true" or 
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(1) Cycle-pointed sequence. 

Algorithm TZseq° {si, S2, • • ■) : 

fci <— Geom(si); &2 <— Geom(si); {indep. calls} 
return a sequence of k\ + ki + 1 atoms (endowed with the identity-automorphism) 

where the atom at position k\ + 1 is marked. 



(2) Cycle-pointed (symmetric cycle-pointed, resp.) set. 

Given (si,ti);>i such that y *tj converges, define the distribution: 

t>l 

Pr{K) = — — — for K > 1 (Pr(X) = — — — for K > 2, resp.). 
Let R00T_CYCLE_SlZE(ti, *2, . . .) (RoOT_CYCLE_SlZE> 2 (ti , *2 , ■ • ■), resp.) be a generator for this distribution. 

Algorithm rZy Ex o (si,ti; S2, *2i ■ ■ ■) ( rZ SET g> (si, ti; 82, ta! • ■ ■), resp.) : 

A' <— RoOT_CYCLE_SiZE(ti , *2, . . .) (A <— Root_Cycle_Size> 2 (tl , *2 , ■ ■ ■), resp.); 

7 +- rZ SET (si,s 2 , .. .); 

Add to 7 a marked cycle of length A; 

return 7. 



(3) Cycle-pointed (symmetric cycle-pointed, resp.) cycle. 

Given (s;, ti)>i such that Z := Zq yc o (si, ti; S2, ti\ ■ • •) (Z := ^cyc® ( S1 > *!i S2 ' * 2 > ■ • •)) res P-) converges, 
consider the probability distribution 
1 t r 

Pr(i? = r) = — f{r) for r > 1 (r > 2, resp.). 

Z 1 — s r 

Let ReplicOrder(si, ti; S2, ti; .. .) be a generator of this distribution. 

Algorithm rZcvc ( s li *1! s 2, *2; • • •) ( rZ CYC <g (si, t\ ; S2, *2 i • ■ ■), resp.) : 
r i — ReplicOrder(si, ti; S2, t% ■ ■ •); 
j < — 1 + Geom (s r ); 

Draw an integer b G [l..r — 1] that is relatively prime to r uniformly at random ; 
return the cycle of length j X r with a marked atom, and endowed with the automorphism: 
"each atom is mapped to the atom that is j X 6 units further on the cycle" 
(the marked cycle is the automorphism-cycle containing the marked atom). 



(1') Cycle-pointed sequence of size A,-; denote £ := Seq'^l. 
Algorithm rZgo (si, t\\ S2, i2; • • ■) : 

return a sequence of k atoms (with the identity-automorphism) where one atom taken u.a.r. is marked 



(2') Cycle-pointed (symmetric cycle-pointed, resp.) set of size k; denote S := Set^. 

A marked (marked symmetric, resp.) partition sequence of order k is a sequence W = {£, m, 712 1 • • • , Wfc) 

such that I > 1 (£ > 2, resp.) and I + irii = k (one block is marked). 

The corresponding coefficient [t^s™ 1 . . - Su h \ is denoted coef^f(Zgo) (coef^f(Zgg>), resp.) 

Algorithm rZgo (si,ti; S2, *2; ■ • ■) ( rZ s@ (si,ti; S2, (2! . . .), resp.) : 
Draw a partition-sequence 7F of order k with probability: 

t/s? 1 sJ ! . . . s? k ■ coef-s^Zgo ) / tes? 1 Sn 2 . . . s? fc • coef^Zom ) \ 

Pr(50 = / , \ J (Pr(vf) = \\ . \ S , resp.) 

Zg° (si,ti; S2,t2! • • -J v Z s@) (si, ti; s 2 , *2i ■ ■ ■) ' 

return a collection of n± cycles of length 1, n.2 cycles of length 2, . . ., cycles of length k, 
together with a marked cycle of length £ (with a marked atom on it taken u.a.r.). 



(3') Cycle-pointed (symmetric cycle-pointed, resp.) cycle of size k; denote C := CycM. 

Algorithm rZ e o (si, ti; S2, t%\ . . .) ( rZ e@ (si, ti; S2, ti\ . . .), resp.) : 
Draw a divisor r of k (divisor r > 2 of k) with distribution 



Pr(r) = = I Pr(r) = = , resp. 

Ze° («i,*i;«3)*a; • • •) ^ Z e8 (si, ti; s 2 , <2; ■ ■ ■) 

Draw an integer b G [l..r — 1] that is relatively prime to r uniformly at random; 
return the cycle of length k with a marked atom and endowed with the automorphism: 
"each atom is mapped to the atom that is kb/r units further on the cycle". 



Figure 11. Polya-Boltzmann sampler for basic cycle-pointed species. In all 
these random samplers, the finally returned structure is made well-labeled using 
the procedure DistributeLabels. 
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Cycle-pointed sum: 7 = 7 + Q. 




Given (s^tiji^i such that Z p := Zy(s\,t\\ s 2 ,t 2 ; ■ ■ ■) and Z q := Z Q {si,ti;S2,h; ■ . .) 


converge: 


Algorithm rZ R (si, h\ s 2 , i 2 ; • • ■) : 




ifRprnfZ /Y"Z -l-"Z ^ then return TZmfq, Irs, t„- \ 

11 UI_j1 11 1 ZJ p / \ £J p \^ £J Q J ) 1j llcll let Lll 11 1_ Zj ^<-> 1 ; ''l; 2 ; "2 1 * ■ • / 




else return TZq(si, t\\ s 2 , i 2 ; . . .) end if 




Cycle-pointed product: ft = 7* B (analog for 31 = B * T) 




Given (s,,£j)i>i such that Z%(si,S2, ■ ■ ■) and Z;p(si,ii;s 2 ,i 2 ; ■ ■ •) converge: 




Algorithm rZ R (si, ii; s 2 , i 2 ; ■ • ■) : 




return (TZy(si, t\\ s 2 , t 2 ; . . .), TZ%(si, s 2 , . . .)) {independent calls} 




Cycle-pointed substitution: 3? = D 3 © 23. 




Given (s l ,t i ) i > 1 such that 6j := Z T ,(s l , s 2i , s 3l , . . .) and % := Z S o (s i: U; s 2l , t 2l ; . . .) converge 


for each i > 1, and such that Zy(b\, gi; 6 2 , g 2 ; . . .) converges: 




Algorithm rZ R (si, *i| s 2 , t 2 ; . . .) : 




Compute (P,<j P ,v) <- TZy(b\, qi; 6 2 , g 2 ; . . .); 




for each unmarked cycle C = (ui, . . . , Uk) of op (ui has smallest label in 


C) 


Compute (B, a B ) <- TZ^(s k , s 2fc , . . .); 




Replace each atom of C by a copy of B; 




for each cycle D of erg do 




Let _E be the cycle composed from the copies of D at u% , . . . , Uk', 




end for 




end for 




Let F= (yi, . . . , Vg) be the marked cycle of P (v± has smallest label in F); 




Compute (Q,crQ,q) <- TZ^o (sg, tg; s 2 e, t 2 f, ■ ■ • ); 




Replace each atom of F by a copy of Q; 




for each cycle G of oq do 




Let H be the cycle composed from the copies of G at v%, . . . , vi\ 




end for 




In the resulting structure R, mark the cycle composed from the copies of 




the marked cycle of Q; 




return (R,<jR,r), where cfr is the automorphism consisting of the cycles E 




and the cycles H, and where r is the atom q in the copy of the marked ( 


;ycle 


of Q substituted at v. 





Figure 12. The rules to specify a Polya-Boltzmann sampler for a cycle- 
pointed species assembled from other species using the cycle-pointed constructions 
{+5*, ©}■ In all these random samplers, the output structure is made well-labeled 
using the procedure DistributeL ABELS. 



"false"). (We work here with the refined weight i s™ 1 ^ 2 . . . s£* jn\ instead of the standard weight 
x n /n\, but the arguments given in [§] work the same way with these refined weights.) 

For the cycle-pointed product, ft = 7*1$, RSym(ft) is like a partitional product of RSym(J') 
and Sym(B). Therefore, a Boltzmann sampler for ft consists of two independent calls to Polya- 
Boltzmann samplers for 7 and for 23. (Again the only difference here with [3] is that we consider 
refined weights: t^s^s^ 2 ...s^ k /n\ for 7, and s™^ 2 ...s^ k /n\ for 23.) The arguments are the 
same for B * 7. 

For cycle-pointed substitution, 7 = 7 © 23, recall (from Equation (|2l)|) in the proof of Propo- 
sition [7]) that for each marked integer partition W, the sum of the weight- monomials over all the 
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symmetries on 31 whose core has type n = (£; n\, n 2 , . . . , n^) satisfies the expression 

Z£ = ^qtb^bz 2 . . .b^ k , where bi = Z^(si,s 2i , ■ ■ ■), qe = 2s» (s e , t e ; s 2t , t 2t ; ■ ■ .)• 

Hence, a Polya-Boltzmann sampler for 31 must draw the core structure following the Polya- 
Boltzmann distribution for 7 with parameters (&i, q±; b 2l q 2 ', ■ ■ ■)• I n addition, once the type tt 
of the core symmetry is fixed, the structures substituted at the cycles of the core automorphism 
form a partitional product of the form 

{RSym(3°) replicated I times} *Sym(B) ni *{Sym(2) duplicated}" 2 *• ••*{Sym(B) replicated k times}"* . 

Recall that a partitional product yields independent Boltzmann samplers. Hence, once the core 
symmetry (P, a) is drawn, the symmetries in 35 that are substituted at each cycle of a a must 
be independent calls of a Polya-Boltzmann sampler for 23, except for the marked cycle where we 
have to call a Polya-Boltzmann sampler for 23°. In addition, for an unmarked (marked, resp.) 
cycle, the parameters of TZ% (of TZ<%o, resp.) must be (si,s 2 i, ■ ■ ■) ((si,U; S2*)*2*i ■ ■ •)) r esp.) if 
the cycle has length i, as indicated by the expression of given above. This is precisely what 
the generator TZr does. □ 

5.3.4. Polya-Boltzmann samplers for decomposable cycle-pointed species. Similarly as for decom- 
posable species, the random generation rules shown in Figure [IT] (basic cycle-pointed species) and 
Figure [T^] (cycle-pointed constructions) can be combined to design a Polya-Boltzmann sampler for 
any species with a cycle-pointed recursive decomposition over basic species. We assume here again 
that an oracle provides the required evaluations of cycle-index sums and pointed cycle-index sums 
for the species appearing in the decomposition; and that the cost of drawing k under a specific 
integer distribution (such as ReplicOrder in TZcyc) has linear cost in k. 

Theorem 30. Any species 3? with a cycle-pointed recursive specification (Definition \3-4\ ! over 
species A\, . . . ,Ai having a linear Polya-Boltzmann sampler can be endowed with a linear Polya- 
Boltzmann sampler TZy(s\, t\] s 2 , t 2 ] . . .). 

Proof. Analogous to the proof of Theorem [251 D 

Consequently, the unrooted species we have encountered in Section [4] can be endowed with 
efficient random samplers. 

Proposition 31. In the real- arithmetic complexity model (oracle assumption), the following unla- 
beled species admit an exact-size sampler and an approximate- size sampler of expected complexities 
0(n 2 ) and 0(n/e) (n being the target-size, e the tolerance-ratio): unrooted nonplane (plane, resp.) 
trees, unrooted nonplane (plane, resp.) trees whose node degrees are constrained to lie in a finite 
integer set fl, unrooted cacti graphs, unrooted connected outerplanar graphs. 

Proof. The crucial point is that cycle-pointing is unbiased, hence finding an exact-size (approxi- 
mate-size, resp.) sampler for a species A is equivalent to finding one for the cycle-pointed species 
A°. 

For each of the unrooted tree species listed above, we have shown in Section 2] that the cor- 
responding cycle-pointed species is decomposable. If S is a species of connected graphs (closed 
under taking 2-connected components), the grammar (|48| given in Proposition [TBI ensures that 
the cycle-pointed species S° is decomposed in terms of the 2-connected graph species 35°, 35', 
and (35')°. For cacti graphs and outerplanar graphs, there is a decomposition strategy for the 2- 
connectcd subspecies (polygons for cacti graphs, dissections of a polygon for outerplanar graphs), 
which easily yields linear Polya-Boltzmann samplers for the species 23°, 23', and (23')°. Since S° 
is specified over these three species, there is also a linear Polya-Boltzmann sampler for 9°- 

Hence, for each unrooted species A stated above, there is a linear Polya-Boltzmann sampler for 
A°, which becomes a linear ordinary Boltzmann sampler when specializing to (s; = x l ,U = x l ). 
Moreover, the counting coefficients \A n \ obey the asymptotic form c p~ n n~ 5 / 2 , as shown in [33] 
for trees, [31] for cacti graphs, and [5] for outerplanar graphs. Therefore the coefficients |A° | obey 
the asymptotic form cp~ n n~ 3 / 2 . 
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Hence, by Proposition 1221 running the Boltzmann sampler TZa° at (s, = p z ,U = p 1 ) - with 
p the dominant singularity of A(x) - yields an exact-size (approximate-size, resp.) sampler for 
unlabeled structures from A with expected complexity 0(n 2 ) (0(n/e), respectively). □ 

5.4. Simplifications. 

5.4.1. Regarding the labels. Notice that the last step of all the random generation rules - as written 
in Figure O Q33 [HI and - is a call to a procedure that distributes distinct labels uniformly at 
random on the atoms of the output structure. A Polya-Boltzmann sampler for a decomposable 
species calls each of these random generation rules a certain number of times until the structure 
is completely build. It is however not necessary to call the label-distribution procedure at each 
step; one can just wait until the complete structure is build to distribute the labels. (If one is only 
interested in the underlying unlabeled structure of the final output, one can simply forget about 
any label-distribution procedure.) 

5.4.2. Regarding the cycles in automorphisms. We want to remark that one can simplify the ran- 
dom generation rules by forgetting about the cyclic order of the atoms in each part of the symme- 
tries (automorphisms), i.e., storing only the partition of the atoms induced by the automorphism. 
The composition of cycles then corresponds to merging the corresponding parts of the partition; 
and the line "draw an integer that is relatively prime to r" is deleted in the Polya-Boltzmann sam- 
plers for cycle-pointed species. These simplified random samplers output the profile of a symmetry 
drawn under the Polya-Boltzmann distribution (by profile we mean that the cyclic order of the 
atoms in each part is forgotten) . This is actually enough if one is just interested in the underlying 
unlabeled structure, which is mostly the case in practice, since Polya-Boltzmann samplers are used 
as a tool to design ordinary Boltzmann samplers. 

5.4.3. Specialization to ordinary Boltzmann samplers. Let Jbe a species with a recursive decom- 
position (cycle-pointed or not) over finitely many other species. Remark 12.11 and Remark 13.31 
ensure that, to compute the only species in the recursive specification for which it is neces- 
sary to know the cycle- index sum are the core species of the substitution operations in the recursive 
specification. For all the other species that appear in the recursive specification, it is enough to 
compute just the ordinary generating function. 

A similar remark holds for random generation. Namely, if one wants an ordinary Boltzmann 
sampler T3 r (x), it suffices to have Polya-Boltzmann samplers for the species that appear as core 
species of substitution operations in the recursive specification. For all the other species, an 
ordinary Boltzmann sampler is enough. 

5.5. Examples. To illustrate how our samplers (and the simplifications discussed above) operate 
in practice, we give here two examples: the species 51 of rooted nonplane trees, already treated 
in [13], and the species H of unrooted nonplane trees, also called free trees; here, one has to consider 
the associated cycle-pointed species, which is new. 

In order to obtain uniform random samplers for unlabeled structures from 51, we design an 
ordinary Boltzmann sampler for 51 via Polya-Boltzmann samplers. Recall that 5i is specified by 
51 = X • Set (51). Hence, according to Section T5.4.31 we need a Polya-Boltzmann sampler for 
Set in order to recursively specify an ordinary Boltzmann sampler T5l(x). Actually, according 
to the discussion in Section 15.4.21 we just need the profile of the automorphism computed by 
TZset- Given these simplifications and the definition of rZg ET (Figure^, we obtain the ordinary 
Boltzmann sampler T5i(x) shown in Figure 1131 

Let us now discuss the species of free trees 2r. We want to sample unlabeled structures from 3" 
uniformly at random. Since cycle-pointing is unbiased, it suffices to sample unlabeled structures 
from the cycle-pointed species 3"°. From the decomposition grammar for 3^° given in Proposi- 
tion [TTJ we can design an ordinary Boltzmann sampler via Polya-Boltzmann samplers. According 
to Section 15.4.31 the only species for which we need Polya-Boltzmann samplers (as refinements of 
ordinary Boltzmann sampers ) are the species Set, Set°, and Set®. Again, only the profile of the 
automorphisms returned by these samplers is necessary. Given the definition of FZsetj TZset , 
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(1) Rooted nonplane trees. 
Algorithm r5(x): 

J <- MAX_lNDEx(S(a;),K(a; 2 ), . . .); 

for j from 1 to J — 1 do kj <— Pois \Jl(x J )/j^ end for 

kj Pois>i (^R(x J )/ ; {Poisson conditioned to output a strictly positive integer} 
7 <— RootNode; 
for j from 1 to J do 

Tj T^x 3 ); 7 7 + {fcj copies of Tj pending from the root} end for 

return 7. 

(2) Cycle-pointed nonplane trees. 

[An auxiliary Boltzmann sampler for unlabeled cycle-pointed rooted nonplane trees] 

Algorithm T3i°(x) 

£ <- Root_Cycle_Size(2;S'(i), x 2 %'(x 2 ), . . .); 
r <- r$°(x*); t' <- rS(x); 

Attach (at their roots) £ copies of r and one copy of r' at a node 
return the resulting tree 

[The Boltzmann sampler for unlabeled cycle-pointed nonplane trees] 
Algorithm Y3°(x) 

if Bern(K(a;)/p(x)) return Y%x) 
else if Bern(x 2 R'(x 2 )/(p(x) - 
r «- rK°(x 2 ); 

return two copies of r attached at an edge 

else 

£ <- Root_Cycle_Size> 2 (x5'(x), x 2 %'(x 2 ), . . .); 
T <- T^°(x e ); t' <- r5(x); 

Attach (at their roots) I copies of r and one copy of r' at a node 
return the resulting tree 



Figure 13. Ordinary Boltzmann samplers for rooted nonplane trees and cycle- 
pointed nonplane trees. Taking x — p the singularity of 3?(x), we repeatedly call 
rD£(x) (r3 ro (a;), respectively) until we reach the target size; this yields an exact- 
size sampler of expected complexity 0(n ) and an approximate-size sampler of 
complexity 0{n/e) for rooted nonplane trees (for free trees, respectively). 

and TZ Set ® (Figure we obtain the ordinary Boltzmann sampler T3^° for unlabeled structures 
shown in Figure [T51 

Note that in these Boltzmann samplers, the only evaluations required are those of the series 
p(x), and of r(x % ) and i'r'(i') for any i > 1. As discussed in Section l5.1.2[ in practice we evaluate 
these series up to the power i = N / log 2 (l/p), where N is the precision (number of bits). 
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